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Application: U.S.S.N. 60/422,663, filed October 31, 2002, which application is hereby 
incorporated by reference in its entirety. 
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BACKGROUND OF THE INVENTION 

Upon exit from the thymus, CD4 and CDS single-positive T cells are thought to enter a 
naive state in which they remain until they encounter specific antigen appropriately presented by 
antigen-presenting cells (APCs). The engagement of the T cell receptors (TCR) by 

15 peptide/MHC complexes in conjunction with costimulatory ligands initiates signaling events 
leading to IL-2 production, cell cycle progression, and differentiation into effector cells. The 
differentiation fate of T cells imdergoing activation is also influenced by exogenous cytokines. 
For CD4+ T cell, IL-12 and IL-4 promote Thl and Th2 differentiation, respectively. Acquisition 
of these effector phenotypes appears to require cell cycle progression and correlates with 

20 epigenetic modification of Uneage-specific gene loci. For CD8+ T cells, although Tel and Tc2 
phenotypes can be similarly induced, priming with TCR and CD28 ligation in the absence of 
cytokines is sufficient to promote differentiation into cytolytic effector cells that produce IFN-y. 
Acquisition of lytic activity by CD8+ cells also requires cell cycle progression, implying a 
similar requirement for epigenetic modification for effector CTL-specific gene expression. A 

25 comparison of current models of differentiation for CD4+ and CD8+ T cells is depicted in 
FIGURE 4. 

In addition to the development of cytolytic activity and cytokine-producing capacity, 
effector CD8+ T cells exhibit several additional fimctional alterations compared to naive CD8+ 
T cells. Effector cells exhibit a lower threshold for TCR-mediated signaling and display a 
30 decreased dependence on CD28 costimulation. Phenotypically, they upregulate expression of 
CD44, downregulate expression of CD62L, and also modulate expression of chemokine 



receptors thought to be important for homing to specific tissues. Effector, but not naive, CD8+ 
cells become susceptible to inhibition by CTLA-4, and to induction of anergy, and appear to 
become inhibitable by other negative regulatory pathways, such as PD-1, and killer inhibitory 
receptors. Thus the primed effector CD8+ phenotype is rather complex and extends beyond 
5 effector function per se. 

Although the naive T cell state was thought to be "quiescent" and relatively inert, two 
lines of recent data have begun to change that perspective. First, several groups have observed 
that signaling through the TCR from self peptide/class I MHC complexes is necessary to 
maintain the siurvival of naive CD8+ T cells in the periphery. Thus, the prevention of apoptotic 

10 death in naive cells is an active process. Second, the transcription factor LKLF has been shown 
to be preferentially expressed in naive T cells and downregulated after activation. In mice, post- 
thymic LKLF-deficient T cells fail to enter a naive resting state and rapidly die, suggesting that 
the naive state may need to be actively maintained. 

The mechanism behind the differentiation that produces such functional distinctions 

15 between naive and effector CD8+ T cells is not well-understood, nor are the reasons for such 
functional distinctions. A detailed understanding of the molecular events and changes in gene 
expression that occur during the transformation of a naive cytotoxic T cell into a CTL may 
provide a better basis on which to develop therapeutics and diagnostics for diseases with a cell- 
mediated immune component, as well as for disorders of the cell-mediated immune system. 

20 Diseases or disorders in which the cell-mediated immune system plays a central role include 
viral infections, parasitic infections, graft rejection, autoimmunity and cancer. Disorders of the 
cell-mediated immune system include various cancers of cytotoxic T cells, such as T cell 
lymphoma, and immunodeficiency disorders, such as common variable immunodeficiency. 

25 SUMMARY OF THE INVENTION 

The present invention relates to novel genes and/or the encoded gene products identified 
as being differentially expressed during CD8+ T cell priming by gene expression profiling. The 
present invention also relates to novel panels of molecular targets comprised of groups of genes 
and/or the encoded gene products that have been identified as being differentially expressed 

30 during CD8+ T cell priming by gene expression profiling. In one embodiment, the panels of 
genes may be comprised of at least one of the genes in FIGURE 1 that are differentially 
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expressed during CD8+ T cell priming. In other embodiments, subject panels may comprise 
subsets of the genes in FIGURE 1. For example, FIGURE 2 comprises genes with known 
functions from FIGURE 1 . A panel of genes, for example, may comprise the genes or gene 
products of FIGURE 2, or a subset thereof. In certain embodiments, groups of dififerentially- 
5 expressed genes with a related biological function may comprise panels of the invention. 
Expression of certain genes increased upon priming, and thus a panel in certain embodiments 
may be comprised of at least one gene whose expression is upregulated during CD8+ T cell 
priming. Likewise, in other embodiments, a panel may be comprised of at least one gene whose 
expression is downregulated during CD8+ T cell priming. The novel panels of the present 
10 invention may also be comprised of the gene products of the panel genes, for example, mRNAs 
and proteins. 

The present invention further relates to the use of the novel panels in methods of 
screening candidate therapeutic agents for use in treating disorders of CDS-i- T cell priming or 
diseases in which CD8+ T cell priming is a component. In one embodiment of the invention, the 

15 disorder is graft rejection. In another embodiment of the invention, the disorder is cancer. In 
some embodiments, candidate therapeutic agents, or "therapeutics", are evaluated for their abihty 
to bind a target protein. The candidate therapeutics may be selected, for example, from the 
following classes of compounds: proteins, peptides, peptidomimetics, small molecules, 
cytokines, or hormones. In other embodiments, candidate therapeutics are evaluated for their 

20 ability to bind a target gene. The candidate therapeutics may be selected, for example, from the 
following classes of compounds: antisense nucleic acids, small molecules, polypeptides, 
proteins, peptidomimetics, or nucleic acid analogs. In some embodiments, the candidate 
therapeutics may be in a library of compounds. These libraries may be generated using 
combinatorial synthetic methods. In certain embodiments of the present invention, the ability of 

25 said candidate therapeutics to bind a target protein may be evaluated by an in vitro assay. In 

embodiments of the invention where the target of the candidate therapeutics is a gene, the ability 
of the candidate therapeutic to bind the gene may be evaluated by an in vitro assay. In either 
embodiment, the binding assay may also be in vivo. 

The present invention further contemplates evaluating candidate therapeutic agents for 

30 their ability to modulate the expression of a target gene by contacting the CD8+ T cells of a 

subject with said candidate therapeutic agents. In certain embodiments, the candidate therapeutic 
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will be evaluated for its ability to normalize the level of expression of a gene or group of genes 
involved in CD8+ T cell priming. The candidate therapeutics may be selected from the 
following classes of compounds: antisense nucleic acids, ribozymes, siRNAs, dominant negative 
mutants of polypeptides encoded by the genes, small molecules, polypeptides, proteins, 
5 peptidomimetics, and nucleic acid analogs. 

Alternatively, candidate therapeutic agents may be evaluated for their ability to inhibit 
the activity of a protein by contacting the CD8-I- T cells of a subject with said candidate 
therapeutic agents. In certain embodiments, a candidate therapeutic may be evaluated for its 
ability to inhibit the activity of a protein that normally promotes CD8+ T cell priming. In other 

10 embodiments, a candidate therapeutic may be evaluated for its ability to inhibit the activity of a 
protein that normally if inhibited promotes CD8H- T cell priming. 

Furthermore, a candidate therapeutic may be evaluated for its ability to normalize the 
level of turnover of a protein encoded by a gene from the panels of the present invention. In 
another embodiment, a candidate therapeutic may be evaluated for its ability to normalize the 

1 5 translational level of a protein encoded by a gene from the panels of the present invention. In yet 
another embodiment, a candidate therapeutic may be evaluated for its ability to normalize the 
level of turnover of an mRNA encoded by a gene from the panels of the present invention. 

Assays and methods of developing assays appropriate for use in the methods described 
above are known to those of skill in the art and, as will be appreciated by those skilled in the art, 

20 may be used as suitable with the methods of the present invention. 

The efficacy of candidate therapeutics identified using the methods of the invention may 
be evaluated by, for example, a) contacting CD8+ cells of a subject with a candidate therapeutic 
and b) evaluating its abiUty to normalize the level of CD8+ T cell priming in the subject's cells 
using assays directed to evaluating the level of CD8+ T cell priming. If a candidate therapeutic 

25 is shown by assay to induce a high level of CD8+ T cell priming, then the candidate may be 
considered a CD8+ T cell priming enhancing drug. Conversely, if a candidate therapeutic is 
shown by assay to inhibit the level of CD8+ T cell priming, then the candidate may be 
considered a CD8+ T cell priming inhibiting dmg. Altematively, the efficacy of candidate 
therapeutics may be evaluated by comparing the expression levels of one or more genes 

30 differentially expressed during CD8+ T cell priming in a CD8+ T cell of a subject having a 
disorder of CD8+ T cell priming, or a disease in which CD8+ T cell priming is a component, 
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with that of a normal CD8+ T cell. In one embodiment, the expression level of the genes may be 
determined using microarrays or other methods of RNA quantitation, or by comparing the gene 
expression profile of an CD8+ T cell treated with a candidate therapeutic with the gene 
expression profile of a normal CD8+ T cell. 
5 The present invention further provides methods of treating disorders of CD8+ T cell 

priming or diseases in which CD8-f- T cell priming is a component using pharmaceutical 
compositions comprised of therapeutic agents identified using the screening methods provided 
by the invention. The present invention contemplates the use of pharmaceutical compositions 
comprised of the therapeutics in a method to treat disorders of CD8+ T cell priming or diseases 
10 in which CD8+ T cell priming is a component. Such methods may include administering to a 
subject having an disorder a pharmaceutically effective amount of an agonist or antagonist of one 
or more genes or their encoded gene products involved in CD8+ T cell priming. Kits comprising 
the pharmaceutical compositions of the present invention are also within the scope of the 
invention. 

15 In one embodiment, the invention provides diagnostic methods for monitoring the 

existence and/or evolution of disorders of CD8+ T cell priming or diseases in which CD8+ T cell 
priming is a component in a subject. For example, the invention provides methods for predicting 
whether a subject is likely to develop such a disorder or disease; methods for confirming that a 
subject, who has been diagnosed as having such a disorder or disease with traditional methods, 

20 has a certain disorder or disease, and not, e.g., a disease that is phenotypically related to it; and 
methods for monitoring the progression of the disease, e.g., in a subject imdergoing treatment. 
Preferred methods comprise evaluating the level of expression of one or more genes that are 
differentially expressed during CD8+ T cell priming in the CD8+ T cells of a subject with or 
suspected of having a disorder of CD8+ T cell priming or disease in which CD8+ T cell priming 

25 is a component. Other methods comprise evaluating the level of expression of tens, himdreds or 
thousands of genes that are differentially expressed during CD8+ T cell priming, e.g., by using 
microarray technology. The expression levels of the genes are then compared to the expression 
levels of the same genes one or more other cells, e.g., a normal cell, or a diseased cell. 
Comparison of the expression levels maybe performed visually. In a preferred 

30 embodiment, the comparison is performed by a computer. In one embodiment, expression levels 
of genes that are differentially expressed during CD8+ T cell priming in cells of subjects having 
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a disorder of CD8+ T cell priming, or disease in which CD8+ T cell priming is a component are 
stored in a computer. The computer may optionally comprise expression levels of these genes in 
normal cells. The data representing expression levels of the genes in a patient being diagnosed 
are then entered into the computer, and compared with one or more of the expression levels 
5 stored in the computer. The computer calculates differences and presents data showing the 
differences in expression of the genes in the two types of cells. 

In one embodiment, a cell sample from a patient is obtained from a caregiver, the level of 
expression of one or more genes that are differentially expressed during CD8+ T cell priming is 
determined, and the expression data are entered into a computer comprising a plurality of 

10 reference expression data associated with particular therapies and compared thereto, to determine 
the most q)propriate therapy for the patient. The method may further comprise sending, e.g., to 
the caregiver, the identity of the appropriate therapy. The data and identity of the appropriate 
therapy may be sent via a network, e.g., the internet. 

In other embodiments of the diagnostic methods contemplated by the present invention, 

15 the method of diagnosis comprises the steps of evaluating the activity of a protein encoded by a 
gene selected from the panels of the invention in the CD8H- T cells of a subject, and comparing 
the activity of said protein in said subject's cells with that in a normal CD8+ T cell of the same 
type. In certain embodiments, a particular type of disorder or disease may be diagnosed if the 
protein whose activity is determined is associated with a particular type of disease or disorder. 

20 The invention also provides compositions comprising one or more detection agents for 

detecting the expression of genes that are differentially expressed during CD8+ T cell priming, 
e.g., for use in diagnostic assays. These agents, which may be, e.g., nucleic acids or 
polypeptides, maybe in solution or bound to a solid surface, such as in the form of a microarray. 
Other embodiments of the invention include databases, computer readable media, computers 

25 containing the gene expression profile[s] of the invention or the level of expression of one or 
more genes that are differentially expressed during CD8+ T cell priming in a CD8+ T cell of a 
subject who may have a disorder of CD8+ T cell priming or a disease in which CD8+ T cell 
priming is a component. 

The present invention further provides a kit comprising a library of gene expression 

30 patterns and reagents for evaluating gene expression levels. For example, the expression level 
may be determined by providing a kit containing appropriate reagents and an appropriate 
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microarray for evaluating the level of expression in the CD8+ T cells of a subject. In other 
embodiments, the invention provides a kit including compositions of the present invention, and 
optionally instructions for their use. Such kits may have a variety of uses, including, for 
example, imaging, diagnosis, therapy, and other applications. 
5 These embodiments of the present invention, other embodiments, and their features and 

characteristics will be apparent from the description, drawings, and claims that follow. 

BRffiF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 depicts genes that exhibit at least 2-fold differential expression during CD8+ 
10 T cell priming. 

FIGURE 2 depicts genes from FIGURE 1 having a known function. 

FIGURE 3 depicts genes from FIGURE 1 whose differential expression was validated by 
RT-PCR, classified by their biological function and whether or not they were modulated during 
2C CD8+ T cell priming. 

15 FIGURE 4 depicts models for A) CD4 Thl/Th2 T cell differentiation and B) CD8+ CTL 

T cell differentiation. 

FIGURE 5 depicts a schematic of the process by which naive and primed CD8+ T cells 
were prepared from 2C/RAG2-/- mice. 

FIGURE 6 depicts the differential expression of putative anti-proliferative genes in naive 
20 and primed 2C CD8+ T cells as detected by RT-PCR. 

FIGURE 7 depicts the diflferential expression of selected cytoskeleton-related proteins in 
naive and primed 2C CD8+ T cells as detected by (A) microarray analysis of mRNA expression 
and (B) reverse transcriptase-PCR analysis of gene expression. 

FIGURE 8 depicts the differential expression of selected adapter proteins, signaling 
25 molecules, and transcription factors in naive and primed 2C CD8+ T cells as detected by (A) 
microarray analysis of mRNA expression and (B) reverse transcriptase-PCR analysis of gene 
expression. 

FIGURE 9 depicts the differential expression of selected effector function-, metabolism-, 
and cytoskeleton-related genes in naive and primed 2C CD8+ T cells as detected by (A) 
30 microarray analysis of mRNA expression and (B) reverse transcriptase-PCR analysis of gene 
expression. 
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DETAILED DESCRIPTION OF THE INVENTION 

1. General 

Peripheral CD8+ T cells circulate in a naive state until they are primed by specific 
5 antigen and differentiate into effector cells. One mechanism by which these two states are 

regulated may be differential gene expression. A detailed understanding of the molecular events 
and changes in gene expression that occur during the transformation of a naive cytotoxic T cell 
into a CTL via differentiation may provide a better basis on which to develop therapeutics and 
diagnostics for diseases with a cell-mediated inunune component, as well as for disorders of the 
10 cell-mediated immune system. Toward this end, we investigated the differential expression of 
specific genes between naive and effector CD8+T cells. 

Basal gene expression profiles of naive and effector 2C TCR transgenic x RAG2"^" CD8+ 
T cells were analyzed using Affymetrix arrays representing 13,000 genes. Approximately 2,000 
genes were observed to exhibit at least 2-fold differential expression during CDSh- T cell 
15 priming, and comprise the genes of FIGURE 1 . Semi-quantitative RT-PCR was used as a 
secondary validation of selected transcripts. This application relates to the genes and/or the 
encoded gene products identified as being differentially expressed during CD8+ T cell priming. 

2. Definitions 

20 For convenience, before fiirther description of the present invention, certain terms 

employed in the specification, examples and appended claims are defined here. 

The singular forms "a", "an", and "the" include plural references unless the context 
clearly dictates otherwise. 

An "address" on an array, e.g., a microarray, refers to a location at which an element, 

25 e.g., an oligonucleotide, is attached to the solid surface of the array. As used herein, a nucleic 

acid or other molecule attached to an array, is referred to as a "probe" or "capture probe." When 
an array contains several probes corresponding to one gene, these probes are referred to as 
"gene-probe set," A gene-probe set may consist of, e.g., 2 to 10 probes, preferably fi-om 2 to 5 
probes and most preferably about 5 probes. 

30 "Agonist" refers to an agent that mimics or up-regulates (e.g., potentiates or supplements) 

the bioactivity of a protein, e.g., polypeptide X. An agonist may be a wild-type protein or 
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derivative thereof having at least one bioactivity of the wild-type protein. An agonist may also 
be a compound that upregulates expression of a gene or which increases at least one bioactivity 
of a protein. An agonist may also be a compound which increases the interaction of a 
polypeptide with another molecule, e.g., a target peptide or nucleic acid. 
5 "Allele", which is used interchangeably herein with "allelic variant", refers to alternative 

forms of a gene or portions thereof Alleles occupy the same locus or position on homologous 
chromosomes. When a subject has two identical alleles of a gene, the subject is said to be 
homozygous for the gene or allele. When a subject has two different alleles of a gene, the 
subject is said to be heterozygous for the gene. Alleles of a specific gene may differ fi:om each 
10 other in a single nucleotide, or several nucleotides, and may include substitutions, deletions, and 
insertions of nucleotides. An allele of a gene may also be a form of a gene containing a 
mutation. 

"Amplification," refers to the production of additional copies of a nucleic acid sequence. 

Amplification is generally carried out using polymerase chain reaction (PGR) technologies well 
15 known in the art (Dieffenbach, C. W. and G. S. Dveksler (1995) PGR Primer, a Laboratory 

Manual, Gold Spring Harbor Press, Plainview, N.Y.). However, those of skill in the art are 

aware of and may choose to use non-PGR amplification methods. 

"Antagonist" refers to an agent that downregulates (e.g., suppresses or inhibits) at least 

one bioactivity of a protein. An antagonist may be a compound which inhibits or decreases the 
20 interaction between a protein and another molecule, e.g., a target peptide or enzyme substrate. 

An antagonist may also be a compound that downregulates expression of a gene or which 

reduces the amount of expressed protein present. 

"Antibody is intended to include whole antibodies, e.g., of any isotype (IgG, IgA, IgM, 

IgE, etc.), and includes fragments thereof which are also specifically reactive with a molecule, 
25 e.g. polypeptide, nucleic acid, small molecule, and the like. In certain embodiments, an antibody 

may be specifically reactive with a vertebrate, e.g., mammalian, polypeptide or nucleic acid. 

Antibodies may be fragmented using conventional techniques and the fragments screened for 

utiUty in the same manner as described above for whole antibodies. Thus, the term includes 

segments of proteolytically-cleaved or recombinantly-prepared portions of an antibody molecule 
30 that are capable of selectively reacting with a certain molecule. Non-limiting examples of such 

proteolytic and/or recombinant fixigments include Fab, F(ab')2, Fab', Fv, and single chain 
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antibodies (scFv) containing a V[L] and/or V[H] domain joined by a peptide linker. The scFv's 
may be covalently or non-covalently linked to form antibodies having two or more binding sites. 
The subject invention includes polyclonal, monoclonal or other purified preparations of 
antibodies and recombinant antibodies. 
5 "Antisense" nucleic acid refers to oligonucleotides which specifically hybridize (e.g., 

bind) under cellular conditions with a gene sequence, such as at the cellular mRNA and/or 
genomic DNA level, so as to inhibit expression of that gene, e.g., by inhibiting transcription 
and/or translation. The binding may be by conventional base pair complementarity, or, for 
example, in the case of binding to DNA duplexes, through specific interactions in the major 
10 groove of the double helix. The term "antisense nucleic acid" also includes RNAs used in RNA 
interference techniques, for example the short interfering RNAs (siRNAs) used in gene silencing 
techniques. 

"Array" or "matrix" refer to an arrangement of addressable locations or "addresses" on a 
device. The locations may be arranged in two dimensional arrays, three dimensional arrays, or 

1 5 other matrix formats. The number of locations may range from several to at least hundreds of 
thousands. Most importantly, each location represents a totally independent reaction site. A 
"nucleic acid array" refers to an array containing nucleic acid probes, such as oligonucleotides or 
larger portions of genes. The nucleic acid on the array is preferably single stranded. Arrays 
wherein the probes are oligonucleotides are referred to as "oligonucelotide arrays" or 

20 "oUgonucleotide chips" or "gene chips". A "microarray", also referred to as a "chip", "biochip", 
or "biological chip", is an array of regions having a density of discrete regions of at least 
100/cm^, and preferably at least about 1000/cm^. The regions in a microarray have typical 
dimensions, e.g. diameters, in the range of between about 10-250 microns, and are separated 
firom other regions in the array by the same distance. 

25 "Biological activity" or "bioactivity" or "activity*' or "biological function", which are 

used interchangeably, refer to an effector or antigenic function that is directly or indirectly 
performed by a polypeptide (whether in its native or denatured conformation), or by any 
subsequence thereof Biological activities include binding to polypeptides, binding to other 
proteins or molecules, activity as a DNA binding protein, as a transcription regulator, ability to 

30 bind damaged DNA, enzymatic activity, etc. A bioactivity may be modulated by directly 
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. affecting the subject polypeptide. Alternatively, a bioactivity may be altered by modulating the 
level of the polypeptide, such as by modulating expression of the corresponding gene. 

"Biological sample" or "sample", refers to a sample obtained from an organism or from 
components (e.g., cells) of an organism. The sample may be of any biological tissue or fluid. 
5 Frequently the sample will be a "clinical sample" which is a sample derived from a patient. Such 
samples include, but are not limited to, sputum, blood, blood cells (e.g., white cells), tissue or 
fine needle biopsy samples, urine, peritoneal fluid, and pleural fluid, or cells therefrom. 
Biological samples may also include sections of tissues such as frozen sections taken for 
histological purposes. 

10 "Biomarker" refers to a biological molecule whose presence, concentration, activity, or 

phosphorylation state may be detected and correlated with the activity of a protein of interest. 

"Cell cycle" refers to a repeating sequence of events in eukaryotic cells consisting of two 
periods: first, a cell-growth period comprising the first gap or growth phase (Gl), the DNA 
synthesis phase (S), and the second gap or growth phase (G2); and second, a cell-division period 

15 comprising mitosis (M). 

"CD8+ T cell" refers to any T cell expressing the CDS biomarker on its surface. The 
term "CD8+ T cell" encompasses all stages of development and activation of such cells, for 
example, naive CD8+ T cells, primed or activated CD8+ T cells, and any progenitors of such 
cells. 

20 "CD8+ T cell priming" refers to the process by which a naive CD8H- T cell undergoes 

clonal expansion and differentiation into an effector cytotoxic T cell (CTL), or "primed CD8+ T 
cell", after it is stimulated by at least one signal. 

"Combinatorial library" or "library" is a plurality of compounds, which may be termed 
"members," synthesized or otherwise prepared from one or more starting materials by employing 

25 either the same or different reactants or reaction conditions at each reaction in the library. In 

general, the members of any Ubrary show at least some structural diversity, which often results in 
chemical diversity. A library may have anywhere from two different members to about 10^ 
members or more. In certain embodiments, libraries of the present invention have more than 
about 12, 50 and 90 members. In certain embodiments of the present invention, the starting 

30 materials and certain of the reactants are the same, and chemical diversity in such libraries is 

achieved by varying at least one of the reactants or reaction conditions during the preparation of 
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the library. Combinatorial libraries of the present invention may be prepared in solution or on 
the solid phase. 

"Complementary" or "complementarity", refer to the natural binding of polynucleotides 
under permissive salt and temperature conditions by base-pairing. For example, the sequence " A- 

5 G-T" binds to the complementary sequence "T-C-A". Complementarity between two single- 
stranded molecules may be "partial", in which only some of the nucleic acids bind, or it may be 
complete when total complementarity exists between the single stranded molecules. The degree 
of complementarity between nucleic acid strands has significant effects on the efficiency and 
strength of hybridization between nucleic acid strands. 

10 "Cytokine*' refers to soluble biochemicals produced by cells that mediate reactions 

between cells, usually used for biological response modifiers. 

"Delivery complex" refers to a targeting means (e.g. a molecule that results in higher 
affinity binding of a gene, protein, polypeptide or peptide to a target cell surface and/or increased 
cellular or nuclear uptake by a target cell). Examples of targeting means include: sterols (e.g. 

IS cholesterol), lipids (e.g. a cationic lipid, virosome or liposome), viruses (e.g. adenovirus, adeno- 
associated virus, and retrovirus) or target cell specific binding agents (e.g. ligands recognized by 
target cell specific receptors). Preferred complexes are sufficiently stable in vivo to prevent 
significant uncoupling prior to internalization by the target cell. However, the complex is 
cleavable imder appropriate conditions within the cell so that the gene, protein, polypeptide or 

20 peptide is released in a fiinctional form. 

"Derived firom" as that phrase is used herein indicates a peptide or nucleotide sequence 
selected fix)m within a given sequence. A peptide or nucleotide sequence derived firom a named 
sequence may contain a small number of modifications relative to the parent sequence, in most 
cases representing deletion, replacement or insertion of less than about 15%, preferably less than 

25 about 10%, and in many cases less than about 5%, of amino acid residues or base pairs present 
in the parent sequence. In the case of DNAs, one DNA molecule is also considered to be derived 
firom another if the two are capable of selectively hybridizing to one another. 

"Derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence may include, for 

30 example, replacement of hydrogen by an alkyl, acyl, or amino group. A derivative 

polynucleotide encodes a polypeptide which retains at least one biological or immunological 
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function of the natural molecule. A derivative polypeptide is one modified by glycosylation, 
pegylation, or any similar process that retains at least one biological or immunological function 
of the polypeptide from which it was derived. 

"Detection agents of genes" refer to agents that may be used to specifically detect the 
5 gene or other biological molecule relating to it, e.g., RNA transcribed from the gene and 

polypeptides encoded by the gene. Exemplary detection agents are nucleic acid probes which 
hybridize to nucleic acids corresponding to the gene and antibodies. 

"Differentiation" refers to the process by which a cell becomes specialized for a specific 
stracture or function by selective gene expression of some genes and selective repression of 
10 others. 

"Differential expression" refers to both quantitative as well as quahtative differences in a 
gene's temporal and/or tissue expression patterns. Differentially expressed genes may represent 
"target genes." 

"Differential gene expression pattern" between cell A and cell B refers to a pattem 

15 reflecting the differences in gene expression between cell A and cell B. A differential gene 
expression pattem may also be obtained between a cell at one time pomt and a cell at another 
time point, or between a cell incubated or contacted with a compound and a cell that was not 
incubated with or contacted with the compound. 

"Disorder of CD8+ T cell priming, or disease in which CD8+ T cell priming is a 

20 component" refers to any disease or disorder in which CD8+ T cells are affected (e.g. are 

diseased) or in which the CD8+ T cell response plays a central role. Examples of disorders in 
which the CD8+ T cell response plays a central role include various cancers of CD8+ T cells, 
such as T cell lymphoma, and immunodeficiency disorders, such as common variable 
immunodeficiency. Examples of diseases or disorders in which the CD8+ T cell response plays 

25 a central role include viral infections, parasitic infections, graft rejection, autoimmunity and 
cancer. "Expression of genes characteristic of a disorder of CD8+ T cell priming, or disease in 
which CD8+ T cell priming is a component" refers to those genes which exhibit a certain level of 
expression when a subject has a given disorder of CD8+ T cell priming, or disease in which 
CD8-f- T cell priming is a component. A "diseased subject" refers to a subject having a disorder 

30 of CD8+ T cell priming, or disease in which CD8+ T cell priming is a component. 
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"Equivalent" refers to nucleotide sequences encoding functionally equivalent 
polypeptides. Equivalent nucleotide sequences will include sequences that differ by one or more 
nucleotide substitutions, additions or deletions, such as allelic variants; and will, therefore, 
include sequences that differ from the nucleotide sequence of the nucleic acids referred to in 
5 FIGURES 1-3 due to the degeneracy of the genetic code. 

"Expression profile," which is used interchangeably herein with "gene expression 
profile" and "finger print" of a cell, refers to a set of values representing mRNA levels of 20 or 
more genes in a cell. An expression profile preferably comprises values representing expression 
levels of at least about 30 genes, preferably at least about 50, 100, 200 or more genes. 
1 0 Expression profiles preferably comprise an mRNA level of a gene which is expressed at similar 
levels in multiple cells and conditions, e.g., GAPDH. For example, an expression profile of a 
diseased cell of disease D refers to a set of values representing mRNA levels of 20 or more genes 
in a diseased cell. 

The "level of expression of a gene in a cell" or "gene expression level" refers to the level 
1 5 of mRNA, as well as pre-mRNA nascent transcript(s), transcript processing intermediates, 
mature mRNA(s) and degradation products, encoded by the gene in the cell. 

"Gene" or "recombinant gene" refer to a nucleic acid molecule comprising an open 
reading frame and including at least one exon and (optionally) an intron sequence. "Intron" 
refers to a DNA sequence present in a given gene which is spliced out during mRNA maturation. 
20 "Gene construct" refers to a vector, plasmid, viral genome or the like which includes a 

"coding sequence" for a polypeptide or which is otherwise transcribable to a biologically active 
RNA (e.g., antisense, decoy, ribozyme, etc), may transfect cells, in certain embodiments 
mammalian cells, and may cause expression of the coding sequence in cells transfected with the 
construct. The gene construct may include one or more regulatory elements operably linked to 
25 the coding sequence, as well as intronic sequences, poly adenylation sites, origins of replication, 
marker genes, etc. 

"Heterozygote," refers to an individual with different alleles at corresponding loci on 
homologous chromosomes. Accordingly, "heterozygous" describes an individual or strain 
having different allelic genes at one or more paired loci on homologous chromosomes. 
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"Homozygote," refers to an individual with the same allele at corresponding loci on 
homologous chromosomes. Accordingly, "homozygous", describes an individual or a strain 
having identical allelic genes at one or more paired loci on homologous chromosomes. 
"Homology** or altematively "identity" refers to sequence similarity between two 
5 peptides or between two nucleic acid molecules. Homology may be determined by comparing a 
position in each sequence which may be aligned for purposes of comparison. When a position in 
the compared sequence is occupied by the same base or amino acid, then the molecules are 
homologous at that position. A degree of homology between sequences is a function of the 
number of matching or homologous positions shared by the sequences. The term "percent 

10 identical" refers to sequence identity between two amino acid sequences or between two 
nucleotide sequences. Identity may each be determined by comparing a position in each 
sequence which may be aligned for purposes of comparison. When an equivalent position in the 
compared sequences is occupied by the same base or amino acid, then the molecules are identical 
at that position; when the equivalent site occupied by the same or a similar amino acid residue 

15 (e.g., similar in steric and/or electronic nature), then the molecules may be referred to as 

homologous (similar) at that position. Expression as a percentage of homology, similarity, or 
identity refers to a function of the nimiber of identical or similar amino acids at positions shared 
by the compared sequences. Various alignment algorithms and/or programs may be used, 
including FASTA, BLAST, or ENTREZ, FASTA and BLAST are available as a part of the GCG 

20 sequence analysis package (University of Wisconsin, Madison, Wis.), and may be used with, 
e.g., default settings. ENTREZ is available through the National Center for Biotechnology 
hiformation, National Library of Medicine, National histitutes of Health, Bethesda, Md. In one 
embodiment, the percent identity of two sequences may be determined by the GCG program 
with a gap weight of 1, e.g., each amino acid gap is weighted as if it were a single amino acid or 

25 nucleotide mismatch between the two sequences. 

Other techniques for alignment are described in Methods in Enzymology, vol. 266: 
Computer Methods for Macromolecular Sequence Analysis (1996), ed. Doolittle, Academic 
Press, Inc., a division of Harcourt Brace & Co., San Diego, California, USA. Preferably, an 
alignment program that permits gaps in the sequence is utilized to align the sequences. The 

30 Smith- Waterman is one type of algorithm that permits gaps in sequence alignments. See Meth. 
Mol Biol. 70: 173-187 (1997). Also, the GAP program using the Needleman and Wunsch 
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alignment method may be utilized to align sequences. An alternative search strategy uses 
MPSRCH software, which runs on a MASPAR computer. MPSRCH uses a Smith- Waterman 
algorithm to score sequences on a massively parallel computer. This approach improves ability 
to pick up distantly related matches, and is especially tolerant of small gaps and nucleotide 
5 sequence errors. Nucleic acid-encoded amino acid sequences may be used to search both protein 
and DNA databases. 

Databases with individual sequences are described in Methods in Enzymology, ed. 
Doolittle, supra. Databases include Genbank, EMBL, and DNA Database of Japan (DDBJ). 

"Hormone" refers to any one of a number of biochemical substances that are produced by 

10 a certain cell or tissue and that cause a specific biological change or activity to occur in another 
cell or tissue located elsewhere in the body. 

"Host cell" refers to a cell transduced with a specified gene construct. The cell is 
optionally selected firom in vitro cells such as those derived fi-om cell culture, ex vivo cells, such 
as those derived firom an organism, and in vivo cells, such as those in an organism. 

1 5 "Recombinant host cells" refers to cells which have been transformed or transfected with vectors 
constructed using recombinant DNA techniques. "Host cells" or "recombinant host cells" are 
terms used interchangeably herein. It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential progeny of such a cell. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 

20 influences, such progeny may not, in fact, be identical to the parent cell, but are still included 
within the scope of the term as used herein. 

"Hybridization" refers to any process by which a strand of nucleic acid binds with a 
complementary strand through base pairing. 

"Specific hybridization" of a probe to a target site of a template nucleic acid refers to 

25 hybridization of the probe predominantly to the target, such that the hybridization signal may be 
clearly interpreted. As fiirther described herein, such conditions resulting in specific 
hybridization vary depending on the length of the region of homology, the GC content of the 
region, the melting temperature "Tm" of the hybrid. Hybridization conditions will thus vary in 
the salt content, acidity, and temperature of the hybridization solution and the washes. 

30 "Interact" is meant to include detectable interactions between molecules, such as may be 

detected using, for example, a hybridization assay. Interact also includes "binding" interactions 
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between molecules. Interactions may be, for example, protein-protein, protein-nucleic acid, 
protein*small molecule or small molecule-nucleic acid in nature. 

"Isolated", with respect to nucleic acids, such as DNA or RNA, refers to molecules 
separated from other DNAs, or RNAs, respectively, that are present in the natural source of the 

5 macromoiecule. Isolated also refers to a nucleic acid or peptide that is substantially free of 
cellular material, viral material, or culture medium when produced by recombinant DNA 
techniques, or chemical precursors or other chemicals when chemically synthesized. Moreover, 
an "isolated nucleic acid" is meant to include nucleic acid fragments which are not naturally 
occurring as fragments and would not be found in the natural state. "Isolated" also refers to 

10 polypeptides which are isolated from other cellular proteins and is meant to encompass both 
purified and recombinant polypeptides. 

"Label" and "detectable label" refer to a molecule capable of detection, including, but not 
limited to, radioactive isotopes, fluorophores, chemiluminescent moieties, enzymes, enzyme 
substrates, enzyme cofactors, enzyme inhibitors, dyes, metal ions, ligands (e.g., biotin or 

15 haptens) and the like. "Fluorophore" refers to a substance or a portion thereof which is capable 
of exhibiting fluorescence in the detectable range. Particular examples of labels which may be 
used under the invention include fluorescein, rhodamine, dansyl, umbelliferone, Texas red, 
limiinol, NADPH, alpha - beta -galactosidase and horseradish peroxidase. 

A "molecular target" or "target" refers to a molecular structure that is a gene or derived 

20 from a gene that has been identified using the methods of the invention as exhibiting differential 
expression between naive CD8H- T cells and primed CD8+ T cells. Exemplary targets as such 
are polypeptides, hormones, receptors, dsDNA fragments, carbohydrates or enzymes. Such 
targets also may be referred to as "target genes", "target peptides", "target proteins", and the like. 
"Modulation" refers to up regulation (i.e., activation or stimulation), down regulation 

25 (i.e., inhibition or suppression) of a response, or the two in combination or apart. 

"Normalizing expression of a gene" in a cell of a diseased subject refers to a means for 
compensating for the altered expression of the gene in the cell, so that it is essentially expressed 
at the same level as in the corresponding cell of a subject without the disease or disorder. For 
example, where the gene is over-expressed in the cell of a diseased subject, normalization of its 

30 expression in the cell refers to treating the cell in such a way that its expression becomes 

essentially the same as the expression in the counterpart cell of a normal, nondiseased subject. 
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"Normalization** preferably brings the level of expression to within approximately a 50% 
difference in expression, more preferably to within approximately a 25%, and even more 
preferably 10% difference in expression. The required level of closeness in expression will 
depend on the particular gene, and may be determined as described herein. 
5 *T^Iormalizing gene expression in a CD8-f T cell of a patient having a disorder of CD8+ T 

ceil priming or disease in which CD8-I- T cell priming is a component" refers to a means for 
normalizing the expression of essentially all genes in the cell. 

"Nucleic acid" refers to polynucleotides such as deoxyribonucleic acid (DNA), and, 
where appropriate, ribonucleic acid (RNA). The term should also be understood to include, as 
1 0 equivalents, analogs of either RNA or DNA made from nucleotide analogs, and, as appUcable to 
the embodiment being described, single (sense or antisense) and double-stranded 
polynucleotides. ESTs, chromosomes, cDNAs, mRNAs, and rRNAs are representative examples 
of molecules that may be referred to as nucleic acids. 

"Nucleic acid corresponding to a gene" refers to a nucleic acid that may be used for 
15 detecting the gene, e.g., a nucleic acid which is capable of hybridizing specifically to the gene. 

^'Nucleic acid sample derived from RNA" refers to one or more nucleic acid molecule, 
e.g., RNA or DNA, that was synthesized from the RNA, and includes DNA resulting from 
methods using PGR, e.g., RT-PCR. 

"Panel" as used herein refers to a group of genes and/or their encoded proteins identified 
20 via a gene expression profile as being differentially expressed during CD8+ T cell priming. 

"Parenteral administration*' and "administered parenterally" means modes of 
adnndnistration other than enteral and topical administration, usually by injection, and includes, 
without limitation, intravenous, intramuscular, intraarterial, intrathecal, intracapsular, 
intraorbital, intracardiac, intradermal, intraperitoneal, transtracheal, subcutaneous, subcuticular, 
25 intra-articular, subcapsular, subarachnoid, intraspinal and intrastemal injection and infusion. 

A "patient", "subject" or "host" to be treated by the subject method may mean either a 
human or non-human animal. 

"Peptidomimetic" refers to a compoxmd containing peptide-like structural elements that is 
capable of mimicking the biological action (s) of a natural parent polypeptide. 
30 "Percent identical*' refers to sequence identity between two amino acid sequences or 

between two nucleotide sequences. Identity may each be determined by comparing a position in 
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each sequence which may be aligned for purposes of comparison. When an equivalent position 
in the compared sequences is occupied by the same base or amino acid, then the molecules are 
identical at that position; when the equivalent site occupied by the same or a similar amino acid 
residue (e.g., similar in steric and/or electronic nature), then the molecules may be referred to as 
5 homologous (similar) at that position. Expression as a percentage of homology, similarity, or 
identity refers to a function of the number of identical or similar amino acids at positions shared 
by the compared sequences. Various alignment algorithms and/or programs may be used, 
including FASTA, BLAST, or ENTREZ. FASTA and BLAST are available as a part of the GCG 
sequence analysis package (University of Wisconsin, Madison, Wis.), and may be used with, 

10 e.g., default settings. ENTREZ is available through the National Center for Biotechnology 

Information, National Library of Medicine, National Listitutes of Health, Bethesda, Md. In one 
embodiment, the percent identity of two sequences may be determined by the GCG program 
with a gap weight of 1, e.g., each amino acid gap is weighted as if it were a single amino acid or 
nucleotide mismatch between the two sequences. Other techniques for alignment are described 

15 in Methods in Enzymology, vol. 266: Computer Methods for Macromolecular Sequence Analysis 
(1996), ed. Doolittle, Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, 
California, USA. Preferably, an aUgnment program that permits gaps in the sequence is utilized 
to align the sequences. The Smith- Waterman is one type of algorithm that permits gaps in 
sequence alignments. SeeMeth. Mol Biol 70: 173-187 (1997). Also, the GAP program using 

20 the Needleman and Wunsch alignment method may be utilized to align sequences. An 
alternative search strategy uses MPSRCH software, which runs on a MASPAR computer. 
MPSRCH uses a Smith- Waterman algorithm to score sequences on a massively parallel 
computer. This approach improves ability to pick up distantly related matches, and is especially 
tolerant of small gaps and nucleotide sequence errors. Nucleic acid-encoded amino acid 

25 sequences may be used to search both protein and DNA databases. Databases with individual 
sequences are described in Methods in Enzymology, ed. Doolittle, supra. Databases include 
Genbank, EMBL, and DNA Database of Japan (DDBJ). 

"Perfectly matched" in reference to a duplex means that the poly- or oligonucleotide 
strands making up the duplex form a double stranded structure with one other such that every 

30 nucleotide in each strand undergoes Watson-Crick basepairing with a nucleotide in the other 
strand. The term also comprehends the pairing of nucleoside analogs, such as deoxyinosine. 
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nucleosides with 2-aminopurine bases, and the like, that may be employed. A mismatch in a 
duplex between a target polynucleotide and an oligonucleotide or olynucleotide means that a pair 
of nucleotides in the duplex fails to undergo Watson-Crick bonding. In reference to a triplex, the 
term means that the triplex consists of a perfectly matched duplex and a third strand in which 
5 every nucleotide undergoes Hoogsteen or reverse Hoogsteen association with a basepair of the 
perfectly matched duplex. 

"Pharmaceutically-acceptable salts'* refers to inorganic and organic acid addition salts of 
compounds that are sufficiently non-toxic. 

"Pharmaceutically acceptable carrier" refers to a pharmaceutically-acceptable material, 

10 composition or vehicle, such as a liquid or solid filler, diluent, excipient, solvent or encapsulating 
material, involved in carrying or transporting any supplement or composition, or component 
thereof, fi-om one organ, or portion of the body, to another organ, or portion of the body. Each 
carrier must be "acceptable" in the sense of being compatible with the other ingredients of the 
supplement and not injurious to the patient. Some examples of materials which may serve as 

15 pharmaceutically acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) 
starches, such as com starch and potato starch; (3) cellulose, and its derivatives, such as sodium 
carboxymethyl cellulose, ethyl cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; 
(6) gelatin; (7) talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as 
peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, com oil and soybean oil; (10) 

20 glycols, such as propylene glycol; (11) polyols, such as glycerin, sorbitol, mannitol and 

polyethylene glycol; (12) esters, such as ethyl oleate and ethyl laurate; (13) agar; (14) buffering 
agents, such as magnesiiun hydroxide and aluminum hydroxide; (15) alginic acid; (16) pyrogen- 
firee water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl alcohol; (20) phosphate buffer 
solutions; and (21) other non-toxic compatible substances employed in pharmaceutical 

25 formulations. 

The "profile" of a cell's biological state refers to the levels of various constituents of a 
cell that are known to change in response to drug treatments and other perturbations of the cell's 
biological state. Constituents of a cell include levels of RNA, levels of protein abxmdances, or 
protein activity levels. 

30 An expression profile in one cell is "similar" to an expression profile in another cell when 

the level of expression of the genes in the two profiles are sufficiently similar that the similarity 
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is indicative of a common characteristic, e.g., being one and the same type of cell. Accordingly, 
the expression profiles of a first cell and a second cell are similar when at least 75% of the genes 
that are expressed in the first cell are expressed in the second cell at a level that is within a factor 
of two relative to the first cell. 
5 "Proliferating" and "proUferation" refer to cells undergoing mitosis. 

"Prophylactic" or "therapeutic" treatment refers to administration to the host of one or 
more of the subject compositions. If it is administered prior to clinical manifestation of the 
unwanted condition (e.g., disease or other unwanted state of the host animal) then the treatment 
is prophylactic, i.e., it protects the host against developing the unwanted condition, whereas if 
10 administered after manifestation of the imwanted condition, the treatment is therapeutic (i.e., it is 
intended to diminish, ameliorate or maintain the existing unwanted condition or side effects 
therefi-om). 

"Protein", "polypeptide" and "peptide" are used interchangeably herein when referring to 
a gene product, e.g., as may be encoded by a coding sequence. By "gene product" it is meant a 
1 5 molecule that is produced as a result of transcription of a gene. Gene products include RNA 
molecules transcribed firom a gene, as well as proteins translated firom such transcripts. 

"Recombinant protein", "heterologous protein" and "exogenous protein" are used 
interchangeably to refer to a polypeptide which is produced by recombinant DNA techniques, 
wherein generally, DNA encoding the polypeptide is inserted into a suitable expression vector 
20 which is in turn used to transform a host cell to produce the heterologous protein. That is, the 
polypeptide is expressed fi*om a heterologous nucleic acid. 

"Small molecule" refers to a composition, which has a molecular weight of less than 
about 1000 kDa . Small molecules may be nucleic acids, peptides, polypeptides, 
peptidomimetics, carbohydrates, lipids or other organic (carbon-containing) or inorganic 
25 molecules. As those skilled in the art will appreciate, based on the present description, libraries 
of chemical and/or biological extensive libraries of chemical and/or biological mixtures, often 
fimgal, bacterial, or algal extracts, may be screened with any of the assays of the invention to 
identify compoxmds that modulate a bioactivity. 

"Surrogate" refers a biological molecule, e.g., a nucleic acid, peptide, hormone, etc., 
30 whose presence or concentration may be detected and correlated with a known condition, such as 
a disease state. 
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"Systemic administration," "administered systemically," "peripheral administration" and 
"administered peripherally*' refer to the administration of a subject supplement, composition, 
therapeutic or other material other than directly into the central nervous system, such that it 
enters the patient's system and, thus, is subject to metabolism and other like processes, for 
5 example, subcutaneous administration. 

"Therapeutic agent" or "therapeutic" refers to an agent capable of having a desired 
biological effect on a host. Chemotherapeutic and genotoxic agents are examples of therapeutic 
agents that are generally known to be chemical in origin, as opposed to biological, or cause a 
therapeutic effect by a particular mechanism of action, respectively. Examples of therapeutic 

10 agents of biological origin include growth factors, hormones, and cytokines. A variety of 
therapeutic agents are known in the art and may be identified by their effects. Certain 
therapeutic agents are capable of regulating red cell proliferation and differentiation. Examples 
include chemotherapeutic nucleotides, drugs, hormones, non-specific (non-antibody) proteins, 
oligonucleotides (e.g., antisense oligonucleotides that bind to a target nucleic acid sequence (e.g., 

15 mRNA sequence)), peptides, and peptidomimetics. 

"Therapeutic effect" refers to a local or systemic effect in animals, particularly mammals, 
and more particularly humans caused by a pharmacologically active substance. The term thus 
means the effect caused by any substance intended for use in the diagnosis, cure, mitigation, 
treatment or prevention of disease or in the enhancement of desirable physical or mental 

20 development and conditions in an animal or human. The phrase "therapeutically-effective 
amount" means that amount of such a substance that produces some desired local or systemic 
effect at a reasonable benefit/risk ratio applicable to any treatment. In certain embodiments, a 
therapeutically effective amount of a compound will depend on its therapeutic index, solubility, 
and the like. For example, certain compounds discovered by the methods of the present 

25 invention may be administered in a sufficient amount to produce a at a reasonable benefit/risk 
ratio applicable to such treatment. 

"Treating" a disease in a subject or "treating" a subject having a disease refers to 
subjecting the subject to a pharmaceutical treatment, e.g., the administration of a drug, such that 
at least one symptom of the disease is decreased or prevented. 

30 "Variant," when used in the context of a polynucleotide sequence, may encompass a 

polynucleotide sequence related to that of gene X or the coding sequence thereof This definition 
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may also include, for example, "allelic," "splice," "species," or "polymorphic" variants. A splice 
variant may have significant identity to a reference molecule, but will generally have a greater or 
lesser number of polynucleotides due to altemate splicing of exons during mRNA processing. 
The corresponding polypeptide may possess additional functional domains or an absence of 

5 domains. Species variants are polynucleotide sequences that vary from one species to another. 
The resulting polypeptides generally will have significant amino acid identity relative to each 
other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single 
nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. 

10 The presence of SNPs may be indicative of, for example, a certain population, a disease state, or 
a propensity for a disease state. 

A "variant" of polypeptide X refers to a polypeptide having the amino acid sequence of 
peptide X in which is altered in one or more amino acid residues. The variant may have 
"conservative" changes, wherein a substituted amino acid has similar stmctural or chemical 

1 5 properties (e.g., replacement of leucine with isoleucine). More rarely, a variant may have 
"nonconservative" changes (e.g., replacement of glycine with tryptophan). Analogous minor 
variations may also include amino acid deletions or insertions, or both. Guidance in evaluating 
which amino acid residues may be substituted, inserted, or deleted without abolishing biological 
or immunological activity may be found using computer programs well known in the art, for 

20 example, LASERGENE software (DNASTAR). 

"Vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to 
which it has been linked. One type of preferred vector is an episome, i.e., a nucleic acid capable 
of extra-chromosomal replication. Preferred vectors are those capable of autonomous replication 
and/or expression of nucleic acids to which they are linked. Vectors capable of directing the 

25 expression of genes to which they are operatively linked are referred to herein as "expression 
vectors". In general, expression vectors of utility in recombinant DNA techniques are often in 
the form of "plasmids" which refer generally to circular double stranded DNA loops, which, in 
their vector form are not bound to the chromosome. In the present specification, "plasmid" and 
"vector" are used interchangeably as the plasmid is the most commonly used form of vector. 

30 However, as will be appreciated by those skilled in the art, the invention is intended to include 
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such other forms of expression vectors which serve equivalent functions and which become 
known in the art subsequently hereto. 

3. Novel Panels of Molecular Targets In the CD8+ T Cell Priming Process 
5 The panels of genes exhibiting differential expression during CD8+ T cell priming 

comprise genes involved in a variety of biological processes. In one embodiment, the panels of 
genes may be comprised of at least one of the genes that are differentially expressed during 
CD8+ T cell priming in FIGURE 1 . In other embodiments, subject panels may comprise subsets 
of the genes in FIGURE 1. For example, expression of certain genes increased upon priming, 
10 and thus a panel in certain embodiments may be comprised of at least one gene whose expression 
is upregulated during CD8+ T cell priming. Likewise, in other embodiments, a panel may be 
comprised of at least one gene whose expression is downregulated during CD8+.T cell priming. 
In other embodiments, the subject panels may comprise genes or gene products whose biological 
function is known. 

15 In certain embodiments, such groups of differentially-expressed genes with a related 

biological function may comprise panels of the invention. In one embodiment, a panel may 
comprise genes or gene products with a single related biological function. In another 
embodiment, a panel may comprise genes or gene products with different biological functions. 
In other embodiments, a panel may be comprised of at least one gene selected from each of the 

20 biological function groups. For example, biological function groups of the invention include 
chemokines, chemokine receptors, cytokines or growth factors, cytokine receptors, G protein 
coupled receptors (GPCRs), nuclear hormone receptors, kinases, kinase modulators, 
phosphatases, phosphatase modulators, transcription factors, transcription co-activators, 
transcription co-factors, CD proteins, proteases, protease inhibitors, channels and transporters, 

25 and cytoskeletal proteins. 

In another embodiment, the novel panels of the present invention may be comprised of 
the gene products of the panel genes, for example mRNAs and proteins. The panels comprise 
sets of molecular targets that are contemplated for use in the therapeutic and diagnostic methods 
described below. 
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4. Therapeutics for Disorders of CD8-f- T Cell Priming or Diseases in Which CD8+ T Cell 
Priming Is a Component 

4A, Therapeutic Agent Screening 

The present invention further relates to the use of the novel panels in methods of 
5 screening candidate therapeutic agents for use in treating disorders of CD8+ T cell priming or 
diseases in which CD8+ T cell priming is a component. In one embodiment of the invention, the 
disorder is graft rejection. In another embodiment of the invention, the disorder is cancer. The 
invention provides a method for identifying a candidate ttierapeutic agent for a disorder of CD8+ 
T cell priming or a disease in which CD8+ T cell priming is a component comprising: 
10 (a) contacting a compound with a panel comprising at least one gene or gene product 

selected from FIGURE 1 ; and 

(b) evaluating whether said compound is a candidate therapeutic for a disorder of CD8+ 
T cell priming or a disease in which CD8+ T cell priming is a component; wherein said 
evaluating step is performed by measuring the interaction between said compoimd and 
1 5 said gene or gene product, or by measuring a change in said gene or gene product caused 

by said compound. 

The present invention further provides methods for evaluating candidate therapeutic 
agents for their ability to modulate the expression of a target gene by contacting the CD8+ T 
cells of a subject with said candidate therapeutic agents. In certain embodiments, the candidate 

20 therapeutic will be evaluated for its ability to normalize the level of expression of a gene or 
group of genes involved in CD8+ T cell priming. The candidate therapeutics may be selected 
from the following classes of compounds: antisense nucleic acids, ribozymes, siRNAs, dominant 
negative mutants of polypeptides encoded by the genes, small molecules, polypeptides, proteins, 
peptidomimetics, and nucleic acid analogs. 

25 Alternatively, candidate therapeutic agents may be evaluated for their ability to inhibit 

the activity of a protein by contacting the CD8+ T cells of a subject with said candidate 
therapeutic agents. In certain embodiments, a candidate therapeutic may be evaluated for its 
ability to inhibit the activity of a protein that normally promotes CD8+ T cell priming. In other 
embodiments, a candidate therapeutic may be evaluated for its ability to inhibit the activity of a 

30 protein that normally if inhibited promotes CD8+ T cell priming. 
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Furthermore, a candidate therapeutic may be evaluated for its ability to normalize the 
level of tximover of a protein encoded by a gene from the panels of the present invention. In 
another embodiment, a candidate therapeutic may be evaluated for its ability to normalize the 
translational level of a protein encoded by a gene from the panels of the present invention. In yet 
5 another embodiment, a candidate therapeutic may be evaluated for its ability to normalize the 
level of tumover of an mRNA encoded by a gene from the panels of the present invention. 

The candidate therapeutics may be selected, for example, from the following classes of 
compounds: proteins, peptides, peptidomimetics, small molecules, cj^okines, or hormones. In 
other embodiments, candidate therapeutics are evaluated for their ability to bind a target gene. 

10 The candidate therapeutics may be selected, for example, from the following classes of 

compoxinds: antisense nucleic acids, small molecules, polypeptides, proteins, peptidomimetics, 
or nucleic acid analogs. In some embodiments, the candidate therapeutics may be in a library of 
compounds. These libraries may be generated using combinatorial synthetic methods. In certain 
embodiments of the present invention, the ability of said candidate therapeutics to bind a target 

15 protein may be evaluated by an in vitro assay. In embodiments of the invention where the target 
of the candidate therapeutics is a gene, the ability of the candidate therapeutic to bind the gene 
may be evaluated by an in vitro assay. In either embodiment, the binding assay may also be in 
vivo. 

4,2. Therapeutic Agent Screening Assays 

20 One aspect of the present invention provides methods for screening various compounds 

as candidate therapeutics for a disorder of CD8+ T cell priming or a disease in which CD8+ T 
cell priming is a component. Such methods comprise assays for evaluating the effect of a 
compound when it contacts a gene or gene product involved in such disorders or diseases, a 
group of genes or gene products, or a whole cell or organism that has or is a model for a disorder 

25 of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component. The result of 
the assays comprising the methods indicate whether or not a compound is a therapeutic agent for 
a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component. A 
result, for example, may be whether or not a compound is able to bind a gene or gene product, 
whether or not a compound is able to modulate the activity of a gene or gene product, or whether 

30 or not a compound is able to reduce a symptom of or slow the progression of a disorder of CD8+ 
T cell priming or a disease in which CD8+ T cell priming is a component in an animal. For 



-26- 



example, a compound that binds to or modulates the activity of a gene or gene product involved 
in a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component 
may be considered a therapeutic agent. Likewise, a compound that reduces or ameliorates the 
symptoms in an animal with a disorder of CD8+ T cell priming or a disease in which CD8+ T 

5 cell priming is a component by binding or modulating the activity of a gene or gene product may 
be considered a therapeutic agent. 

Assays and methods of developing assays appropriate for use in the methods described 
above are known to those of skill in the art, and are contemplated for use as appropriate with the 
methods of the present invention. The ability of said candidate therapeutics to bind a target 

10 molecule on the panels of the present invention may be evaluated using a variety of appropriate 
assays known to those of skill in the art. Such assays are referred to herein as "binding assays". 
Likewise, the ability of the compound to modulate the activity of a gene or gene product, the 
expression of a gene, the level of turnover of a protein, the translational level of a protein, or the 
level of turnover of an mRNA may be evaluated. In certain embodiments of the present 

1 5 invention, the ability of a candidate therapeutic to bind a target protein or gene may be evaluated 
by an in vitro assay. In either embodiment, the assay may also be an in vivo assay. In other 
embodiments, in vitro or in vivo assays may be conducted to identify molecules that modulate 
the expression and or activity of a gene. Such assays are referred to herein as "gene expression 
assays". Altematively, in vitro or in vivo assays may be conducted to identify molecules that 

20 modulate the activity of a protein encoded by a gene. Such assays are referred to herein as 
"activity assays". 

Those of skill in the art will recognize that in certain screening assays, it will be sufficient 
to assess the level of expression of a single gene and that in others, the expression of two or more 
is preferred, whereas still in others, the expression of essentially all the genes involved in CD8+ 

25 T cell priming is preferably assessed. Likewise, it will be sufficient to assess the activity of a 

single protein in some screening assays, whereas in others, the activities of multiple proteins may 
be assessed. Examples of assays contemplated for use in the present invention include, but are 
not limited to, binding assays such as competitive binding assays, direct binding assay, and two- 
hybrid assay; and expression and activity assays such as cell proliferation assays, kinase assays, 

30 phosphatase assays, nuclear hormone translocator assays, fluorescence activated cell screening 
(FACS) assays, colony-forming/plaque assays, and polymerase chain reaction assays. Such 
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assays are well-known to those of skill in the art and may be adapted to the methods of the 
present invention with no more than routine experimentation. 

All of the above screening methods may be accomplished using a variety of assay 
formats. In light of the present disclosure, those not expressly described herein will nevertheless 
5 be known and comprehended by those of ordinary skill in the art. The assays may identify drugs 
which are, e.g., either agonists or antagonists, of expression of a target gene of interest, or of a 
protein-protein or protein-substrate interaction of a target of interest, or of the role of target gene 
products in the pathogenesis of normal or abnormal cellular physiology, proliferation, and/or 
differentiation and disorders related thereto. Assay formats which approximate such conditions 

10 as formation of protein complexes or protein-nucleic acid complexes, enzymatic activity, and 
even specific signaling pathways, may be generated in many different forms, and include but are 
not limited to assays based on cell-free systems, e.g. purified proteins or cell lysates, as well as 
cell-based assays which utilize intact cells. 
4.2a. In vitro assay methods 

15 In many drug screening programs which test libraries of compounds and natural extracts, 

high throughput assays are desirable in order to maximize the number of compounds surveyed in 
a given period of time. Assays of the present invention which are performed in cell-free systems, 
such as may be derived with purified or semi-purified proteins or with lysates, are often preferred 
as "primary" screens in that they may be generated to permit rapid development and relatively 

20 easy detection of an alteration in a molecular target which is mediated by a test compound. 
Moreover, the effects of cellular toxicity and/or bioavailabiUty of the test compound may be 
generally ignored in the in vitro system, the assay instead being focused primarily on the effect 
of the compound on the molecular target as may be manifest in an alteration of binding affinity 
with other proteins or changes in enzymatic properties of the molecular target. Accordingly, 

25 potential modifiers, e.g., activators or inhibitors of protein-substrate, protein-protein interactions 
or nucleic acid-protein interactions of interest may be detected in a cell-free assay generated by 
constitution of fimction interactions of interest in a cell lysate. Such protein-substrate, protein- 
protein, or nucleic acid-protein interactions of interest may be identified by the gene expression 
profiling methods described herein. In an altemate format, the assay may be derived as a 

30 reconstituted protein mixture which, as described below, offers a number of benefits over lysate- 
based assays. 
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In one aspect, the present invention provides assays that may be used to screen for 
compounds which modulate protein-protein interactions, nucleic acid-protein interactions, or 
protein-substrate interactions of interest. For instance, the drug screening assays of the present 
invention may be designed to detect agents which disrupt binding of protein-protein interaction 
5 binding moieties. In other embodiments, the subject assays will identify inhibitors of the 
enzymatic activity of a protein or protein-protein interaction complex. In a preferred 
embodiment, the compound is a mechanism based inhibitor which chemically alters one member 
of a protein-protein interaction or one chemical group of a protein and which is a specific 
inhibitor of that member, e.g. has an inhibition constant 10-fold, 100-fold, or more preferably, 

10 1000-fold different compared to homologous proteins. 

In one embodiment of the present invention, screening assays may be generated which 
detect inhibitory compounds on the basis of their ability to interfere with binding of components 
of a given protein-substrate, protein-protein, or nucleic acid-protein interaction of interest. In an 
exemplary binding assay, the compound of interest is contacted with a mixture generated from 

15 protein-protein interaction component polypeptides. Detection and quantification of expected 
activity from a given protein-protein interaction provides a means for evaluating the compound's 
efficacy at inhibiting (or potentiating) complex formation between the two polypeptides. The 
efficacy of the compound may be assessed by generating dose response curves from data 
obtained using various concentrations of the test compound. Moreover, a control assay may also 

20 be performed to provide a baseline for comparison. In the control assay, the formation of 
complexes is quantitated in the absence of the test compound. 

Complex formation between component polypeptides, polypeptides and genes, or 
between a component polypeptide and a substrate may be detected by a variety of techniques, 
many of which are effectively described above. For instance, modulation in the formation of 

25 complexes may be quantitated using, for example, detectably labeled proteins (e.g. radiolabeled, 
fluorescently labeled, or enzymatically labeled), by immunoassay, or by chromatographic 
detection. 

Accordingly, one exemplary screening assay of the present invention includes the steps 
of contacting a polypeptide or fimctional fragment thereof or a binding partner with a test 
30 compound or library of test compounds and detecting the formation of complexes. For detection 
purposes, the molecule may be labeled with a specific marker and the test compound or library 
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of test compounds labeled with a different marker. Interaction of a test compound with a 
polypeptide or fragment thereof or binding partner may then be detected by evaluating the level 
of the two labels after an incubation step and a washing step. The presence of two labels after 
the washing step is indicative of an interaction. 

5 An interaction between molecules may also be identified by using real-time BIA 

(Biomolecular Interaction Analysis, Pharmacia Biosensor AB) which detects surface plasmon 
resonance (SPR), an optical phenomenon. Detection depends on changes in the mass 
concentration of macromolecules at the biospecific interface, and does not require any labeling 
of interactants. In one embodiment, a library of test compounds may be immobilized on a sensor 

10 surface, e.g., which forms one wall of a micro-flow cell. A solution containing the polypeptide, 
ftmctional fragment thereof, polypeptide analog or binding partner is then flown continuously 
over the sensor surface. A change in the resonance angle as shown on a signal recording, 
indicates that an interaction has occurred. This technique is fiuther described, e.g., in 
BIAtechnology Handbook by Pharmacia. 

1 5 Another exemplary screening assay of the present invention includes the steps of (a) 

forming a reaction mixture including: (i) a polypeptide, (ii) a binding partner, and (iii) a test 
compound; and (b) detecting interaction of the polypeptide and the binding partner. The 
polypeptide and binding partner may be produced recombinantly, purified from a source, e.g., 
plasma, or chemically synthesized, as described herein. A statistically significant change 

20 (potentiation or inhibition) in the interaction of the polypeptide and binding partner in the 

presence of the test compound, relative to the interaction in the absence of the test compoimd, 
indicates a potential agonist (mimetic or potentiator) or antagonist (inhibitor) of polypeptide 
bioactivity for the test compound. The compounds of this assay may be contacted 
simultaneously. Altematively, a polypeptide may first be contacted with a test compound for an 

25 appropriate amount of time, following which the binding partner is added to the reaction mixture. 
The efficacy of the compound may be assessed by generating dose response curves from data 
obtained using various concentrations of the test compound. Moreover, a control assay may also 
be performed to provide a baseline for comparison. In the control assay, isolated and purified 
polypeptide or binding partner is added to a composition containing the binding partner or 

30 polypeptide, and the formation of a complex is quantitated in the absence of the test compound. 
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Complex formation between a polypeptide and a binding partner may be detected by a 
variety of techniques. Modulation of the formation of complexes may be quantitated using, for 
example, detectably labeled proteins such as radiolabeled, fluorescently labeled, or 
enzymatically labeled polypeptides or binding partners, by immimoassay, or by chromatographic 
5 detection. 

Typically, it will be desirable to immobilize either the polypeptide or its binding partner 
to facilitate separation of complexes from uncomplexed forms of one or both of the proteins, as 
well as to accommodate automation of the assay. Binding of polypeptide to a binding partner 
may be accompUshed in any vessel suitable for containing the reactants. Examples include 

10 microtitre plates, test tubes, and micro-centrifiige tubes. In one embodiment, a fusion protein 
may be provided which adds a domain that allows the protein to be bound to a matrix. For 
example, glutathione-S-transferase/polypeptide (GST/polypeptide) fusion proteins may be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
derivatized microtitre plates, which are then combined with the binding partner, e.g. an 35s- 

15 labeled binding partner, and the test compound, and the mixture incubated under conditions 

conducive to complex formation, e.g. at physiological conditions for salt and pH, though slightly 
more stringent conditions may be desired. Following incubation, the beads are washed to 
remove any unbound label, and the matrix immobilized and radiolabel determined directly (e.g. 
beads placed in scintillant), or in the supernatant after the complexes are subsequently 

20 dissociated. Altematively, the complexes may be dissociated from the matrix, separated by 
SDS-PAGE, and the level of polypeptide or binding partner found in the bead fraction 
quantitated from the gel using standard electrophoretic techniques such as described in the 
appended examples. 

Other techniques for immobilizing proteins on matrices are also available for use in the 
25 subject assay. For instance, either the polypeptide or its cognate binding partner may be 
immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated 
polypeptide molecules may be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques well known in the art (e.g., biotinylation kit. Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Altematively, 
30 antibodies reactive with the polypeptide may be derivatized to the wells of the plate, and 

polypeptide trapped in the wells by antibody conjugation. As above, preparations of a binding 
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partner and a test compound are incubated in the polypeptide presenting wells of the plate, and 
the amount of complex trapped in the well may be quantitated. Exemplary methods for detecting 
such complexes, in addition to those described above for the GST-immobiUzed complexes, 
include immunodetection of complexes using antibodies reactive with the binding partner, or 
5 which are reactive with polypeptide and compete with the binding partner; as well as enzyme- 
linked assays which rely on detecting an enzymatic activity associated with the binding partner, 
either intrinsic or extrinsic activity. In the instance of the latter, the enzyme may be chemically 
conjugated or provided as a fusion protein with the binding partner. To illustrate, the binding 
partner may be chemically cross-linked or genetically fused with horseradish peroxidase, and the 

10 amount of polypeptide trapped in the complex may be assessed with a chromogenic substrate of ^ 
the enzyme, e.g. 3,3 -diamino-benzadine terahydrochloride or 4-chloro-l-napthol. Likewise, a 
fusion protein comprising the polypeptide and glutathione-S-transferase may be provided, and 
complex formation quantitated by detecting the GST activity using l-chloro-2,4-dinitrobenzene 
(Habig et al (1974) J Biol Chem 249:7130). 

1 5 For processes that rely on immunodetection for quantitating one of the proteins trapped in 

the complex, antibodies against the protein, such as anti-polypeptide antibodies, may be used. 
Altematively, the protein to be detected in the complex may be "epitope-tagged" in the form of a 
fusion protein which includes, in addition to the polypeptide sequence, a second polypeptide for 
which antibodies are readily available (e.g. from commercial sources). For instance, the GST 

20 fusion proteins described above may also be used for quantification of binding using antibodies 
against the GST moiety. Other useful epitope tags include myc-epitopes (e.g., see ElUson et al. 
(1991) J Biol Chem 266:21 150-21 157) which includes a 10-residue sequence from c-myc, as 
well as the pFLAG system (Intemational Biotechnologies, Inc.) or the pEZZ-protein A system 
(Pharmacia, NJ). 

25 In certain in vitro embodiments of the present assay, the protein or the set of proteins 

engaged in a protein-protein, protein-substrate, or protein-nucleic acid interaction comprises a 
reconstituted protein mixture of at least semi-purified proteins. By semi-purified, it is meant that 
the proteins utiUzed in the reconstituted mixture have been previously separated from other 
cellular or viral proteins. For instance, in contrast to cell lysates, the proteins involved in a 

30 protein-substrate, protein-protein or nucleic acid-protein interaction are present in the mixture to 
at least 50% purity relative to all other proteins in the mixture, and more preferably are present at 
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90-95% purity. In certain embodiments of the subject method, the reconstituted protein mixture 
is derived by mixing highly purified proteins such that the reconstituted mixture substantially 
lacks other proteins (such as of cellular or viral origin) which might interfere with or otherwise 
alter the ability to measure activity resulting from the given protein-substrate, protein-protein 

5 interaction, or nucleic acid-protein interaction. 

In one embodiment, the use of reconstituted protein mixtures allows more careful control 
of the protein-substrate, protein-protein, or nucleic acid-protein interaction conditions. 
Moreover, the system may be derived to favor discovery of inhibitors of particular intermediate 
states of the protein-protein interaction. For instance, a reconstituted protein assay may be 

10 carried out both in the presence and absence of a candidate agent, thereby allowing detection of 
an inhibitor of a given protein-substrate, protein-protein, or nucleic acid-protein interaction. 

Assaying biological activity resulting from a given protein-substrate, protein-protein or 
nucleic acid-protein interaction, in the presence and absence of a candidate inhibitor, may be 
accomplished in any vessel suitable for containing the reactants. Examples include microtitre 

15 plates, test tubes, and micro-centrifiige tubes. 

Typically, it will be desirable to immobilize one of the polypeptides to facilitate 
separation of complexes from uncomplexed forms of one of the proteins, as well as to 
acconmiodate automation of the assay. In an illustrative embodiment, a frision protein may be 
provided which adds a domain that permits the protein to be bound to an insoluble matrix. For 

20 example, protein-protein interaction component fusion proteins may be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized 
microtitre plates, which are then combined with a potential interacting protein, e.g. an ^^S- 
labeled polypeptide, and the test compound and incubated under conditions conducive to 
complex foraiation e.g., at 4°C m a buffer of 2mM Tris-HCl (pH 8), InM EDTA, 0.5% Nonidet 

25 P-40, and lOOmM NaCl . Following incubation, the beads are washed to remove any unbound 
interacting protein, and the matrix bead-bound radiolabel determined directly (e.g. beads placed 
in scintillant), or in the supernatant after the complexes are dissociated, e.g. when microtitre plate 
is used. Altematively, after washing away unbound protein, the complexes may be dissociated 
from the matrix, separated by SDS-PAGE gel, and the level of interacting polypeptide foimd in 

30 the matrix-bound fraction quantitated from the gel using standard electrophoretic techniques. 
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In yet another embodiment, the protein-protein interaction component or potential 
interacting polypeptide may be used to generate an two-hybrid or interaction trap assay (see also, 
U.S. Patent No: 5,283,317; Zervos et al (1993) Cell 72:223-232; Madura et al (1993) J Biol 
Chem 268:12046-12054; Bartel etal (1993) Biotechniques 14:920-924; and Iwabuchi etal 
5 (1993) Oncogene 8: 1693-1696), for subsequently detecting agents which disrupt binding of the 
interacting components to one another. 

In particular, the method makes use of chimeric genes which express hybrid proteins. To 
illustrate, a first hybrid gene comprising the coding sequence for a DNA-binding domain of a 
transcriptional activator may be fused in firame to the coding sequence for a "bait" protein, e.g., a 

10 protein-protein interaction component polypeptide of sufficient length to bind to a potential 

interacting protein. The second hybrid protein encodes a transcriptional activation domain fused 
in fi-ame to a gene encoding a "fish" protein, e.g., a potential interacting protein of sufficient 
length to interact with the protein-protein interaction component polypeptide portion of the bait 
fusion protein. If the bait and fish proteins are able to interact, e.g., form a protein-protein 

1 5 interaction component complex, they bring into close proximity the two domains of the 
transcriptional activator. This proximity causes transcription of a reporter gene which is 
operably linked to a transcriptional regulatory site responsive to the transcriptional activator, and 
expression of the reporter gene may be detected and used to score for the interaction of the bait 
and fish proteins. 

20 In accordance with the present invention, the method includes providing a host cell. The 

host cell contains a reporter gene having a binding site for the DNA-binding domain of a 
transcriptional activator used in the bait protein, such that the reporter gene expresses a 
detectable gene product when the gene is transcriptionally activated. The first chimeric gene may 
be present in a chromosome of the host cell, or as part of an expression vector. 

25 The host cell also contains a first chimeric gene which is capable of being expressed in 

the host cell. The gene encodes a chimeric protein, which comprises (i) a DNA-binding domain 
that recognizes the responsive element on the reporter gene in the host cell, and (ii) a bait 
protein, such as a protein-protein interaction component polypeptide sequence. 

A second chimeric gene is also provided which is capable of being expressed in the host 

30 cell, and encodes the "fish" fusion protein. In one embodiment, both the first and the second 
chimeric genes are introduced into the host cell in the form of plasmids. Preferably, however. 
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the first chimeric gene is present in a chromosome of the host cell and the second chimeric gene 
is introduced into the host cell as part of a plasmid. 

Preferably, the DNA-binding domain of the first hybrid protein and the transcriptional 
activation domain of the second hybrid protein are derived fi-om transcriptional activators having 
5 separable DNA-binding and transcriptional activation domains. For instance, these separate 
DNA-binding and transcriptional activation domains are known to be found in the yeast GAL4 
protein, and are known to be found in the yeast GCN4 and ADRl proteins. Many other proteins 
involved in transcription also have separable binding and transcriptional activation domains 
which make them usefiil for the present invention, and include, for example, the Lex A and VP 16 

1 0 proteins. It will be understood that other substantially transcriptionally-inert DNA-binding 

domains may be used in the subject constructs; such as domains of ACEl, Xcl, lac repressor, jxm 
or fos. In another embodiment, the DNA-binding domain and the transcriptional activation 
domain may be fi-om different proteins. The use of a LexA DNA binding domain provides 
certain advantages. For example, in yeast, the LexA moiety contains no activation fimction and 

15 has no known effect on transcription of yeast genes. In addition, use of LexA allows control 
over the sensitivity of the assay to the level of interaction (see, for example, the Brent et al PCT 
publication WO94/10300). 

In preferred embodiments, any enzymatic activity associated with the bait or fish proteins 
is inactivated, e.g., dominant negative or other mutants of a protein-protein interaction 

20 component may be used. 

Continuing with the illustrated example, the protein-protein interaction component- 
mediated interaction, if any, between the bait and fish fiision proteins in the host cell, therefore, 
causes the activation domain to activate transcription of the reporter gene. The method is carried 
out by introducing the first chimeric gene and the second chimeric gene into the host cell, and 

25 subjecting that cell to conditions under which the bait and fish fiision proteins are expressed in 
sufficient quantity for the reporter gene to be activated. The formation of a protein-protein 
interaction component/interacting protein complex results in a detectable signal produced by the 
expression of the reporter gene. Accordingly, the level of formation of a complex in the 
presence of a test compound and in the absence of the test compound may be evaluated by 

30 detecting the level of expression of the reporter gene in each case. Various reporter constructs 
may be used in accord with the methods of the invention and include, for example, reporter 
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genes which produce such detectable signals as selected from the group consisting of an 
enzymatic signal, a fluorescent signal, a phosphorescent signal and drug resistance. 

One aspect of the present invention provides reconstituted protein preparations, e.g., 
combinations of proteins participating in protein-protein interactions. 
5 4,2b. In vivo assay methods 

Such methods are referred to within this section as in vivo as they involve the use of 
whole cells in culture or the use of animals or samples taken therefrom. In an illustrative 
embodiment, cells derived from a subject having a disorder of CD8+ T cell priming or a disease 
in which CD8+ T cell priming is a component, and their progeny, can be used to screen various 
10 compoxmds. Such cells can be maintained in minimal culture media for extended periods of time 
(e.g., for 7-21 days or longer) and can be contacted with any compound, to determine the effect 
of such compound on a property of the cell, e.g. gene expression, protein-protein interactions, 
protein activity, cellular growth, proliferation or differentiation of progenitor cells in the culture, 
and the like. 

1 5 Detection and quantification of growth, proliferation or differentiation of these cells in 

response to a given compound provides a means for evaluating the compound's efficacy at 
inducing any of the properties of growth, proliferation or differentiation. For example, certain 
disorders of CD8+ T cell priming involve the inability or reduced ability of a naive CD8+ T cell 
to differentiate into a primed CD8+ T cell. Methods of measuring cell proliferation are well 

20 known in the art and most commonly include evaluating DNA synthesis or protein synthesis 
characteristic of cell replication. There are numerous methods in the art for measuring DNA or 
protein synthesis, any of which may be used according to the invention. In an embodiment of 
the invention, DNA synthesis may be determined using a radioactive label (e.g., ^H-thymidine) 
or labeled nucleotide analogues (5-bromo-2-deoxyuridine, or BrdU) for detection by 

25 immunofluorescence. In another embodiment of the invention, protein synthesis may be 

determined using a radioactive labeled amino acid (e.g., ^H-leucine) or labeled amino acid or 
amino acid analogues for detection by immunofluorescence. The efficacy of the compoimd can 
be assessed by generating dose response curves from data obtained using various concentrations 
of the compoimd. A control assay can also be performed to provide a baseline for comparison. 

30 Identification of the progenitor cell population(s) ampUfied in response to a given test compoimd 
can be carried out according to such phenotyping as described above. 
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In still further embodiments, a protein-protein, protein-substrate, or protein-nucleic acid 
interaction of interest is generated in whole cells, taking advantage of cell culture techniques to 
support the subject assay. For example, as described below, the interaction of interest may be 
constituted in a eukaryotic cell culture system, including mammalian and yeast cells. 
5 Advantages to generating the subject assay in an intact cell include the ability to detect inhibitors 
which are functional in an environment more closely approximating that which therapeutic use 
of the inhibitor would require, including the ability of the compoimd to gain entry into the cell. 
Furthermore, certain of the in vivo embodiments of the assay, such as examples given below, are 
amenable to high through-put analysis of candidate agents. 

10 The components of the interaction of interest may be endogenous to the cell selected to 

support the assay, Altematively, some or all of the components may be derived from exogenous 
sources. For instance, fusion proteins may be introduced into the cell by recombinant techniques 
(such as through the use of an expression vector), as well as by microinjecting the fusion protein 
itself or mRNA encoding the fusion protein. 

15 In any case, the cell may be ultimately manipulated after incubation with a candidate 

inhibitor in order to facilitate detection of a protein-protein, protein-substrate, or protein-nucleic 
acid interaction-mediated signaling event (e.g. modulation of a post-translational modification of 
a protein-protein interaction component substrate, such as phosphorylation, modulation of 
transcription of a gene in response to cell signaling, etc.). As described above for assays 

20 performed in reconstituted protein mixtures or lysate, the effectiveness of a candidate inhibitor 
may be assessed by measuring direct characteristics of an interaction component, such as shifts 
in molecular weight by electrophoretic means or detection in a binding assay. For these 
embodiments, the cell will typically be lysed at the end of incubation with the candidate agent, 
and the lysate manipulated in a detection step in much the same manner as might be the 

25 reconstituted protein mixture or lysate, e.g., described above. 

Indirect measurement of an interaction may also be accompUshed, for example, by 
detecting a biological activity associated with a protein-protein interaction component that is 
modulated by a protein-protein interaction mediated signaling event. As set out above, the use of 
fusion proteins comprising a protein-protein interaction component polypeptide and an 

30 enzymatic activity are representative embodiments of the subject assay in which the detection 
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means relies on indirect measurement of a protein-protein interaction component polypeptide by 
quantitating an associated enzymatic activity. 

In other embodiments, the biological activity of a nucleic acid-protein, protein-substrate 
or protein-protein interaction component polypeptide may be assessed by monitoring changes in 
5 the phenotype of the targeted cell For example, changes in the phenotype of a cell may be 
assess by gene expression profiling, described in Section 6. In another example, the detection 
means may include a reporter gene construct which includes a transcriptional regulatory element 
that is dependent in some form on the level of an interaction component or a interaction 
component substrate. The protein interaction component may be provided as a fusion protein 

1 0 with a domain which binds to a DNA element of the reporter gene construct. The added domain 
of the fusion protein may be one which, through its DNA-binding ability, increases or decreases 
transcription of the reporter gene. Whichever the case may be, its presence in the fusion protein 
renders it responsive to the protein-protein interaction-mediated signaling pathway. 
Accordingly, the level of expression of the reporter gene will vary with the level of expression of 

IS the protein interaction component. 

The reporter gene product is a detectable label, such as luciferase, p-lactamase or p- 
galactosidase, and is produced in the intact cell. The label may be measured in a subsequent 
lysate of the cell. However, the lysis step is preferably avoided, and providing a step of lysing 
the cell to measure the label will typically only be employed where detection of the label caimot 

20 be accompHshed in whole cells. 

Moreover, in the whole cell embodiments of the subject assay, the reporter gene construct 
may provide, upon expression, a selectable marker. A reporter gene includes any gene that 
expresses a detectable gene product, which may be RNA or protein. Preferred reporter genes are 
those that are readily detectable. The reporter gene may also be included in the construct in the 

25 form of a fusion gene with a gene that includes desired transcriptional regulatory sequences or 
exhibits other desirable properties. For instance, the product of the reporter gene may be an 
enzyme which confers resistance to antibiotic or other drug, or an enzyme which complements a 
deficiency in the host cell (i.e. thymidine kinase or dihydrofolate reductase). To illustrate, the 
aminoglycoside phosphotransferase encoded by the bacterial transposon gene Tn5 neo may be 

30 placed under transcriptional control of a promoter element responsive to the level of a protein- 
protein interaction component polypeptide present in the cell. Such embodiments of the subject 
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assay are particularly amenable to high through-put analysis in that proliferation of the ceil may 
provide a simple measure of inhibition of an interaction. 

Other examples of reporter genes include, but are not limited to CAT (chloramphenicol 
acetyl transferase) (Alton and Vapnek (1979), Nature 282: 864-869) luciferase, and other 

5 enzyme detection systems, such as p-galactosidase, P-lactamase, (G. Zlokamik, et al. (1998) 
Science, 279:84-88), firefly luciferase (deWet et al. (1987), Mol Cell Biol 7:725-737), bacterial 
luciferase (Engebrecht and Silverman (1984), PNAS 1: 4154-4158; Baldwin et al. (1984), 
Biochemistry 23: 3663-3667), alkaline phosphatase (Toh et al. (1989) Eur, J. Biochem. 182: 231- 
238, Hall et al. (1983) /. Mol Appl Gen. 2: 101), human placental secreted alkaline phosphatase 

10 (Cullen and Malim (1992) Methods in Enzymol 216:362-368). 

The amount of transcription from the reporter gene may be measured using any method 
known to those of skill in the art to be suitable. For example, specific mRNA expression may be 
detected using Northem blots or specific protein product may be identified by a characteristic 
stain, western blots or an intrinsic activity. 

15 In preferred embodiments, the product of the reporter gene is detected by an intrinsic 

activity associated with that product. For instance, the reporter gene may encode a gene product 
that, by enzymatic activity, gives rise to a detection signal based on color, fluorescence, or 
limiinescence. 

The amount of expression from the reporter gene is then compared to the amount of 
20 expression in either die same cell in the absence of the test compound or it may be compared 
with the amount of transcription in a substantially identical cell that lacks a component of the 
interaction of interest. 

4. 3. Therapeutic Agent Efficacy Screening 

The efficacy of candidate therapeutics identified using the methods of the invention may 
25 be evaluated by, for example, a) contacting CD8+ cells of a subject with a candidate therapeutic 
and b) evaluating its ability to normalize the level of CD8+ T cell priming in the subject or 
normalize the gene expression in the subject's CD8+ cells using assays directed to evaluating the 
same. If a candidate therapeutic is shown by assay to induce a high level of CD8+ T cell 
priming, then the candidate may be considered a CD8+ T cell priming enhancing dmg. 
30 Conversely, if a candidate therapeutic is shown by assay to inhibit the level of CD8+ T cell 
priming, then the candidate may be considered a CD8+ T cell priming inhibiting drug. 
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Alternatively, the efficacy of candidate therapeutics may be evaluated by comparing the 
expression levels of one or more genes differentially expressed during CD8+ T cell priming in a 
CD8+ T cell of a subject having a disorder of CD8+ T cell priming, or a disease in which CD8+ 
T cell priming is a component, with that of a normal CD8+ T cell. In one embodiment, the 
5 expression level of the genes may be determined using microarrays or other methods of RNA 
quantitation, or by comparing the gene expression profile of an CD8+ T cell treated with a 
candidate therapeutic with the gene expression profile of a normal CD8+ T cell. 

The efficacy of the compounds may then be tested in additional in vitro assays and in 
vivOy and in tumor xenograft studies. A test compound may be administered to a test animal and 

10 inhibition of tumor growth monitored. Expression of one or more genes differentially expressed 
during CD8+ T cell priming may also be measured before and after administration of the test 
compound to the animal. A normalization of the expression of one or more of these genes is 
indicative of the efficiency of the compound for treating disorders of CD8+ T cell priming, or 
diseases in which CD8+ T cell priming is a component in the animal. 

15 In another embodiment of the invention, a drag is developed by rational drag design, i.e., 

it is designed or identified based on information stored in computer readable form and analyzed 
by algorithms. More and more databases of expression profiles are currently being established, 
numerous ones being publicly available. By screening such databases for the description of 
drags affecting the expression of at least some of the genes differentially expressed during CD8+ 

20 T cell priming in a manner similar to the change in gene expression profile fi'om a CD8+ T cell 
fi-om an individual with a disorder of CD8+ T cell priming, or a disease in which CD8+ T cell 
priming is a component, to that of a normal individual's CD8+ T cell corresponding to the CD8+ 
T cell firom the individual having said disorder or disease, compounds may be identified which 
normaUze gene expression. Derivatives and analogues of such compounds may then be 

25 synthesized to optimize the activity of the compound, and tested and optimized as described 
above. 

Compounds identified by the methods described above are within the scope of the 
invention. Compositions comprising such compounds, in particular, compositions comprising a 
pharmaceutically efficient amount of the drag in a pharmaceutically acceptable carrier are also 
30 provided. Certain compositions comprise one or more active compound for treating disorders of 
CD8+ T cell priming, or diseases in which CD8+ T cell priming is a component 
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4.4. Pharmaceutical Compositions of Therapeutic Agents 

The present invention further provides methods of treating disorders of CD8+ T cell 
priming or diseases in which CD8+ T cell priming is a component using pharmaceutical 
compositions comprised of therapeutic agents identified using the screening methods provided 

5 by the invention. The present invention includes the use of pharmaceutical compositions 

comprised of the therapeutics in a method to treat disorders of CD8+ T cell priming or diseases 
in which CD8+ T cell priming is a component. Such methods may include administering to a 
subject having an disorder a pharmaceutically effective amount of an agonist or antagonist of one 
or more genes or their encoded gene products involved in CD8+ T cell priming. 

10 The compounds of the present invention may be administered by various means, 

depending on their intended use, as is well known in the art. For example, if compounds of the 
present invention are to be administered orally, they may be formulated as tablets, capsules, 
granules, powders or syrups. Alternatively, formulations of the present invention may be 
administered parenterally as injections (intravenous, intramuscular or subcutaneous), drop 

1 5 infusion preparations or suppositories. For application by the ophthahnic mucous membrane 
route, compounds of the present invention may be formulated as eyedrops or eye ointments. 
These formulations may be prepared by conventional means, and, if desired, the compounds may 
be mixed with any conventional additive, such as an excipient, a binder, a disintegrating agent, a 
lubricant, a corrigent, a solubilizing agent, a suspension aid, an emulsifying agent or a coating 

20 agent. 

Li formulations of the subject invention, wetting agents, emulsifiers and lubricants, such 
as sodium lauryl sulfate and magnesium stearate, as well as coloring agents, release agents, 
coating agents, sweetening, flavoring and perfuming agents, preservatives and antioxidants may 
be present in the formulated agents. 

25 Subject compounds may be suitable for oral, nasal, topical (including buccal and 

sublingual), rectal, vaginal, aerosol and/or parenteral administration. The formulations may 
conveniently be presented in unit dosage form and may be prepared by any methods well known 
in the art of pharmacy. The amount of agent that may be combined with a carrier material to 
produce a single dose vary depending upon the subject being treated, and the particular mode of 

30 administration. 
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Methods of preparing these formulations include the step of bringing into association 
agents of the present invention with the carrier and, optionally, one or more accessory 
ingredients. In general, the formulations are prepared by uniformly and intimately bringing into 
association agents with liquid carriers, or finely divided solid carriers, or both, and then, if 
5 necessary, shaping the product. 

Formulations suitable for oral administration may be in the form of capsules, cachets, 
pills, tablets, lozenges (using a flavored basis, usually sucrose and acacia or tragacanth), 
powders, granules, or as a solution or a suspension in an aqueous or non-aqueous liquid, or as an 
oil-in-water or water-in-oil liquid emulsion, or as an elixir or syrup, or as pastilles (using an inert 

10 base, such as gelatin and glycerin, or sucrose and acacia), each containing a predetermined 

amount of a compound thereof as an active ingredient. Compounds of the present invention may 
also be administered as a bolus, electuary, or paste. 

In solid dosage forms for oral administration (capsules, tablets, pills, dragees, powders, 
granules and the like), the coordination complex thereof is mixed with one or more 

15 pharmaceutically acceptable carriers, such as sodium citrate or dicalcium phosphate, and/or any 
of the following: (1) fillers or extenders, such as starches, lactose, sucrose, glucose, maimitol, 
and/or silicic acid; (2) binders, such as, for example, carboxymethylcellulose, alginates, gelatin, 
polyvinyl pyrroUdone, sucrose and/or acacia; (3) humectants, such as glycerol; (4) disintegrating 
agents, such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, certain 

20 siUcates, and sodium carbonate; (5) solution retarding agents, such as paraffin; (6) absorption 
accelerators, such as quatemary anmionium compounds; (7) wetting agents, such as, for 
example, acetyl alcohol and glycerol monostearate; (8) absorbents, such as kaolin and bentonite 
clay; (9) lubricants, such a talc, calcium stearate, magnesium stearate, solid polyethylene glycols, 
sodium lauryl sulfate, and mixtures thereof; and (10) coloring agents. In the case of capsules, 

25 tablets and pills, the compositions may also comprise buffering agents. Solid compositions of a 
similar type may also be employed as fillers in soft and hard-filled gelatin capsules using such 
excipients as lactose or milk sugars, as well as high molecular weight polyethylene glycols and 
the like. 

A tablet may be made by compression or molding, optionally with one or more accessory 
30 ingredients. Compressed tablets may be prepared using binder (for example, gelatin or 

hydroxypropyhnethyl cellulose), lubricant, inert diluent, preservative, disintegrant (for example. 



-42- 



sodium starch glycolate or cross-linked sodium carboxymethyl cellulose), surface-active or 
dispersing agent. Molded tablets may be made by molding in a suitable machine a mixture of the 
supplement or components thereof moistened with an inert liquid diluent. Tablets, and other 
soHd dosage fomis, such as dragees, capsules, pills and granules, may optionally be scored or 

5 prepared with coatings and shells, such as enteric coatings and other coatings well known in the 
pharmaceutical-formulating art. 

Liquid dosage forms for oral administration include pharmaceutically acceptable 
emulsions, microemulsions, solutions, suspensions, syrups and elixirs. In addition to the 
compound, the liquid dosage forms may contain inert diluents commonly used in the art, such as, 

10 for example, water or other solvents, solubiUzing agents and emulsifiers, such as ethyl alcohol, 
isopropyl alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, propylene 
glycol, 1,3-butylene glycol, oils (in particular, cottonseed, groundnut, com, germ, olive, castor 
and sesame oils), glycerol, tetrahydrofuryl alcohol, polyethylene glycols and fatty acid esters of 
sorbitan, and mixtures thereof 

15 Suspensions, in addition to compounds, may contain suspending agents as, for example, 

ethoxylated isostearyl alcohols, polyoxyethylene sorbitol and sorbitan esters, microcrystalline 
cellulose, aluminum metahydroxide, bentonite, agar-agar and tragacanth, and mixtures thereof. 

Formulations for rectal or vaginal administration may be presented as a suppository, 
which may be prepared by mixing a coordination complex of the present invention with one or 

20 more suitable non-irritating excipients or carriers comprising, for example, cocoa butter, 

polyethylene glycol, a suppository wax or a salicylate, and which is solid at room temperature, 
but liquid at body temperature and, therefore, will melt in the body cavity and release the active 
agent. Formulations which are suitable for vaginal administration also include pessaries, 
tampons, creams, gels, pastes, foams or spray formulations containing such carriers as are known 

25 in the art to be appropriate. 

Dosage forms for transdermal administration of a supplement or component includes 
powders, sprays, ointments, pastes, creams, lotions, gels, solutions, patches and inhalants. The 
active component may be mixed imder sterile conditions with a pharmaceutically acceptable 
carrier, and with any preservatives, buffers, or propellants which may be required. For 

30 transdermal administration of transition metal complexes, the complexes may include lipophilic 
and hydrophihc groups to achieve the desired water solubility and transport properties. 
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The ointments, pastes, creams and gels may contain, in addition to a supplement or 
components thereof, excipients, such as animal and vegetable fats, oils, waxes, paraffins, starch, 
tragacanth, cellulose derivatives, polyethylene glycols, siUcones, bentonites, silicic acid, talc and 
zinc oxide, or mixtures thereof 
5 Powders and sprays may contain, in addition to a supplement or components thereof, 

excipients such as lactose, talc, silicic acid, aluminum hydroxide, calcium silicates and 
polyamide powder, or mixtures of these substances. Sprays may additionally contain customary 
propellants, such as chlorofluorohydrocarbons and volatile unsubstituted hydrocarbons, such as 
butane and propane. 

10 Compoimds of the present invention may alternatively be administered by aerosol. This 

is accomplished by preparing an aqueous aerosol, liposomal preparation or solid particles 
containing the compound. A non-aqueous (e.g., fluorocarbon propellant) suspension could be 
used. Sonic nebulizers may be used because they minimize exposing the agent to shear, which 
may result in degradation of the compoimd. 

15 Ordinarily, an aqueous aerosol is made by formulating an aqueous solution or suspension 

of the compound together with conventional pharmaceutically acceptable carriers and stabilizers. 
The carriers and stabilizers vary with the requirements of the particular compound, but typically 
include non-ionic surfactants (Tween®, Pluronic®, or polyethylene glycol), innocuous proteins 
like serum albumin, sorbitan esters, oleic acid, lecithin, amino acids such as glycine, buffers, 

20 salts, sugars or sugar alcohols. Aerosols generally are prepared from isotonic solutions. 

Pharmaceutical compositions of this invention suitable for parenteral administration 
comprise one or more components of a supplement in combination with one or more 
pharmaceutically-acceptable sterile isotonic aqueous or non-aqueous solutions, dispersions, 
suspensions or emulsions, or sterile powders which may be reconstituted into sterile injectable 

25 solutions or dispersions just prior to use, which may contain antioxidants, buffers, bacteriostats, 
solutes which render the formulation isotonic with the blood of the intended recipient or 
suspending or thickening agents. 

Examples of suitable aqueous and non-aqueous carriers which may be employed in the 
pharmaceutical compositions of the invention include water, ethanol, polyols (such as glycerol, 

30 propylene glycol, polyethylene glycol, and the like), and suitable mixtures thereof, vegetable 
oils, such as oUve oil, and injectable organic esters, such as ethyl oleate. Proper fluidity may be 
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maintained, for example, by the use of coating materials, such as lecithin, by the maintenance of 
the required particle size in the case of dispersions, and by the use of surfactants. 
4.5. Methods of Treatment Using Pharmaceutical Compositions 
The dosage of any pharmaceutical composition of the present invention will vary 
5 depending on the symptoms, age and body weight of the patient, the nature and severity of the 
disorder to be treated or prevented, the route of administration, and the form of the supplement. 
Any of the subject formulations may be administered in a single dose or in divided doses. 
Dosages for the compounds of the present invention may be readily determined by techniques 
known to those of skill in the art or as taught herein. Also, the present invention provides 

10 mixtures of more than one subject compound, as well as other therapeutic agents. 

The precise time of administration and amount of any particular compound that will yield 
the most effective treatment in a given patient will depend upon the activity, pharmacokinetics, 
and bioavailability of a particular compound, physiological condition of the patient (including 
age, sex, disease type and stage, general physical condition, responsiveness to a given dosage 

15 and type of medication), route of administration, and the like. The guidelines presented herein 
may be used to optimize the treatment, e.g., determining the optimum time and/or amount of 
administration, which will require no more than routine experimentation consisting of 
monitoring the subject and adjusting the dosage and/or timing. 

While the subject is being treated, the health of the patient may be monitored by 

20 measuring one or more of the relevant indices at predetermined times during a 24-hour period. 
Treatment, including supplement, amounts, times of administration and formulation, maybe 
optimized according to the results of such monitoring. The patient may be periodically 
reevaluated to determine the extent of improvement by measuring the same parameters, the first 
such reevaluation typically occurring at the end of four weeks from the onset of therapy, and 

25 subsequent reevaluations occurring every four to eight weeks during therapy and then every 
three months thereafter. Therapy may continue for several months or even years, with a 
minimum of one month being a typical length of therapy for humans. Adjustments to the 
amoimt(s) of agent administered and possibly to the time of administration may be made based 
on these reevaluations. 
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Treatment may be initiated with smaller dosages which are less than the optimum dose of 
the compound. Thereafter, the dosage may be increased by small increments until the optimum 
therapeutic effect is attained. 

The combined use of several compounds of the present invention, or alternatively other 
5 chemotherapeutic agents, may reduce the required dosage for any individual component because 
the onset and duration of effect of the different components may be complimentary. In such 
combined therapy, the different active agents may be delivered together or separately, and 
simultaneously or at different times within the day. 

Toxicity and therapeutic efficacy of subject compounds may be determined by standard 
10 pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 

LD50 and the ED50. Compositions that exhibit large therapeutic indices are preferred. Although 
compounds that exhibit toxic side effects may be used, care should be taken to design a delivery 
system that targets the compounds to the desired site in order to reduce side effects. 

The data obtained firom the cell culture assays and animal studies may be used in 
15 formulating a range of dosage for use in humans. The dosage of any supplement, or alternatively 
of any components therein, lies preferably within a range of circulating concentrations that 
include the ED50 with little or no toxicity. The dosage may vary within this range depending 
upon the dosage form employed and the route of administration utilized. For agents of the 
present invention, the therapeutically effective dose may be estimated initially fi-om cell culture 
20 assays. A dose may be formulated in animal models to achieve a circulating plasma 

concentration range that includes the IC50 (i.e., the concentration of the test compoimd which 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information 
may be used to more accurately determine useful doses in hxraians. Levels in plasma may be 
measured, for example, by high performance liquid chromatography. 

25 
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5. Compositions Comprising Probes Derived from Targets of the Invention 

The present invention provides compositions comprised of probes derived from the 
sequences of the genes or proteins encoded by them comprising the panels of the present 
invention. These compositions are contemplated for use in diagnostic applications as discussed 
5 herein. The invention also provides compositions comprising one or more detection agents for 
detecting the expression of genes whose expression is characteristic of a disorder of CD8+ T cell 
priming or disease in which CD8+ T cell priming is a component, e.g., for use in diagnostic 
assays. These agents, which may be, e.g., nucleic acids or polypeptides, maybe in solution or 
bound to a solid surface, such as in the form of a microarray. Other embodiments of the 

10 invention include databases, computer readable media, computers containing the gene expression 
profile[s] of the invention or the level of expression of one or more genes whose expression is 
characteristic of a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is 
a component in a diseased CD8+ T cell. The composition may comprise probes corresponding to 
at least 10, preferably at least 20, at least 50, at least 100 or at least 1000 genes involved in 

15 neoplasia. The composition may comprise probes corresponding to each gene Usted in FIGURE 
1, or subsets of those genes in FIGURE 1 which are up-regulated or down-regulated during 
CD8+ T cell priming. 

In one embodiment of the present invention, the composition is a microarray. There may 
be one or more than one probe corresponding to each gene on a microarray. For example, a 

20 microarray may contain from 2 to 20 probes corresponding to one gene and preferably about 5 to 
10. The probes may correspond to the full length RNA sequence or complement thereof of 
genes that are differentially expressed during CD8+ T cell priming, or they may correspond to a 
portion thereof, which portion is of sufficient length for permitting specific hybridization. Such 
probes may comprise from about 50 nucleotides to about 100, 200, 500, or 1000 nucleotides or 

25 more than 1000 nucleotides. As further described herein, microarrays may contain 

oUgonucleotide probes, consisting of about 10 to 50 nucleotides, preferably about 15 to 30 
nucleotides and even more preferably 20-25 nucleotides. The probes are preferably single 
stranded. The probe will have sufficient complementarity to its target to provide for the desired 
level of sequence specific hybridization (see below). 

30 Typically, the arrays used in the present invention will have a site density of greater than 

100 different probes per cm^, although any suitable site density is included in the present 
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invention. Preferably, the arrays will have a site density of greater than 500/cm^, more 
preferably greater than about 1000/cm^, and most preferably, greater than about 10,000/cm^. 
Preferably, the arrays will have more than 100 different probes on a single substrate, more 
preferably greater than about 1000 different probes still more preferably, greater than about 
5 10,000 different probes and most preferably, greater than 100,000 different probes on a single 
substrate. 

Microarrays maybe prepared by methods known in the art, as described below, or they 
maybe custom made by companies, e.g., Affymetrix (Santa Clara, CA). 

Generally, two types of microarrays maybe used. These two types are referred to as 

10 "synthesis" and "delivery." In the synthesis type, a microarray is prepared in a step-wise fashion 
by the in situ synthesis of nucleic acids from nucleotides. With each round of synthesis, 
nucleotides are added to growing chains until the desired length is achieved. In the delivery type 
of microarray, pre-prepared nucleic acids are deposited onto known locations using a variety of 
deUvery technologies. Numerous articles describe the different microarray technologies, e.g., 

15 Shena et al. (1998) Tibtech 16: 301; Duggan et al. (1999) Nat. Genet. 21:10; Bowtell et al. 
(1999; Mi^. Genet. 21:25. 

One novel synthesis technology is that developed by Affymetrix (Santa Clara, CA), 
which combines photolithography technology with DNA synthetic chemistry to enable high 
density oligonucleotide microarray manufacture. Such chips contain up to 400,000 groups of 

20 oligonucleotides in an area of about 1,6 cm^. Oligonucleotides are anchored at the 3' end 

thereby maximizing the availability of single-stranded nucleic acid for hybridization. Generally 
such chips, referred to as "GeneChips®" contain several oUgonucleotides of a particular gene, 
e.g., between 15-20, such as 16 oUgonucleotides. Since Asymetrix (Santa Clara, CA) sells 
custom made microarrays, microarrays containing genes that are differentially expressed during 

25 CD8+ T cell priming may be ordered for purchase from Affymetrix (Santa Clara, CA). 

Microarrays may also be prepared by mechanical microspotting, e.g., those 
conunercialized at Synteni (Fremont, CA). According to these methods, small quantities of 
nucleic acids are printed onto sohd surfaces. Microspotted arrays prepared at Synteni contain as 
many as 10,000 groups of cDNA in an area of about 3.6 cm^. 

30 A third group of microarray technologies consist in the "drop-on-demand" deUvery 

approaches, the most advanced of which are the ink-jetting technologies, which utilize 
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piezoelectric and other forms of propulsion to transfer nucleic acids from miniature nozzles to 
solid surfaces. Inkjet technologies are developed at several centers including Incyte 
Pharaiaceuticals (Palo Alto, CA) and Protogene (Palo Alto, CA). This technology results in a 
density of 10,000 spots per cml See also, Hughes et al. (2001) Nat. Biotechn, 19:342. 
5 Arrays preferably include control and reference nucleic acids. Control nucleic acids are 

nucleic acids which serve to indicate that the hybridization was effective. For example, all 
Affymetrix (Santa Clara, CA) expression arrays contain sets of probes for several prokaryotic 
genes, e.g., bioB, bioC and bioD from biotin synthesis of E. coh and ere from PI bacteriophage. 
Hybridization to these arrays is conducted in the presence of a mixture of these genes or portions 

10 thereof, such as the mix provided by Affymetrix (Santa Clara, CA) to that effect (Part Number 
900299), to thereby confirm that the hybridization was effective. Control nucleic acids included 
with the target nucleic acids may also be mRNA synthesized from cDNA clones by in vitro 
transcription. Other control genes that may be included in arrays are polyA controls, such as 
dap, lys, phe, thr, and trp (which are included on Affymetrix GeneChips®) 

15 Reference nucleic acids allow the normalization of results from one experiment to 

another, and to compare multiple experiments on a quantitative level. Exemplary reference 
nucleic acids include housekeeping genes of known expression levels, e.g., GAPDH, hexokinase 
and actin. 

Mismatch controls may also be provided for the probes to the target genes, for expression 
20 level controls or for normalization controls. Mismatch controls are oligonucleotide probes or 
other nucleic acid probes identical to their corresponding test or control probes except for the 
presence of one or more mismatched bases. 

Arrays may also contain probes that hybridize to more than one allele of a gene. For 
example the array may contain one probe that recognizes allele 1 and another probe that 
25 recognizes allele 2 of a particular gene. 

Microarrays may be prepared as follows. In one embodiment, an array of 
oligonucleotides is synthesized on a solid support. Exemplary solid supports include glass, 
plastics, polymers, metals, metalloids, ceramics, organics, etc. Using chip masking technologies 
and photoprotective chemistry it is possible to generate ordered arrays of nucleic acid probes. 
30 These arrays, which are known, e.g., as "DNA chips," or as very large scale immobilized 
polymer arrays ("VLSIPS™" arrays) may include miUions of defined probe regions on a 
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substrate having an area of about 1 cm^ to several cm^, thereby incorporating sets of from a few 
to millions of probes {see, e.g., U.S. Patent No. 5,631,734). 

The construction of soUd phase nucleic acid arrays to detect target nucleic acids is well 
described in the literature. See, Fodor et al. (1991) Science, 251: 161-111\ Sheldon et al. (1993) 

5 Clinical Chemistry 39(4): 718-719; Kozal et al. (1996) Nature Medicine 2(7): 753-759 and 

Hubbell U.S. Pat. No. 5,571,639; Pinkel et al. PCT/US95/16155 (WO 96/17958); U.S. Pat. Nos. 
5,677,195; 5,624,711; 5,599,695; 5,451,683; 5,424,186; 5,412,087; 5,384,261; 5,252,743 and 
5,143,854; PCX Patent Publication Nos. 92/10092 and 93/09668; and PCX WO 97/10365. In 
brief, a combinatorial strategy allows for the synthesis of arrays containing a large number of 

10 probes using a minimal number of synthetic steps. For instance, it is possible to synthesize and 
attach all possible DNA 8 mer oligonucleotides (48, or 65,536 possible combinations) using only 
32 chemical synthetic steps. In general, VLSIPS™ procedures provide a method of producing 4n 
different oligonucleotide probes on an array using only 4n synthetic steps {see, e.g., U.S. Pat. No. 
5,631,734; 5,143,854 and PCX Patent Publication Nos. WO 90/15070; WO 95/1 1995 and WO 

15 92/10092). 

Light-directed combinatorial synthesis of oligonucleotide arrays on a glass surface may 
be performed with automated phosphoramidite chemistry and chip masking techniques similar to 
photoresist technologies in the computer chip industry. Xypically, a glass surface is derivatized 
with a silane reagent containing a functional group, e.g., a hydroxyl or amine group blocked by a 

20 photolabile protecting group. Photolysis through a photolithogaphic mask is used selectively to 
expose functional groups which are then ready to react with incoming 5*-photoprotected 
nucleoside phosphoramidites. Xhe phosphoramidites react only with those sites which are 
illuminated (and thus exposed by removal of the photolabile blocking group). Xhus, the 
phosphoramidites only add to those areas selectively exposed from the preceding step. Xhese 

25 steps are repeated until the desired array of sequences have been synthesized on the solid surface. 

Algorithms for design of masks to reduce the number of synthesis cycles are described by 
Hubbel et al., U.S. Pat. No. 5,571,639 and U.S. Pat. No. 5,593,839. A computer system may be 
used to select nucleic acid probes on the substrate and design the layout of the array as described 
in U.S. Pat. No. 5,571,639. 

30 Another method for synthesizing high density arrays is described in U.S. Patent No. 

6,083,697. Xhis method utiUzes a novel chemical amplification process using a catalyst system 
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which is initiated by radiation to assist in the synthesis the polymer sequences. Methods of the 
present invention include the use of photosensitive compounds which act as catalysts to 
chemically alter the synthesis intermediates in a manner to promote formation of polymer 
sequences. Such photosensitive compounds include what are generally referred to as radiation- 
5 activated catalysts (RACs), and more specifically photo activated catalysts (PACs). The RACs 
may by themselves chemically alter the synthesis intermediate or they may activate an 
autocatalytic compound which chemically alters the synthesis intermediate in a manner to allow 
the synthesis intermediate to chemically combine with a later added synthesis intermediate or 
other compound. 

10 Arrays may also be synthesized in a combinatorial fashion by delivering monomers to 

cells of a support by mechanically constrained flowpaths. See Winkler et al., EP 624,059. Arrays 
may also be synthesized by spotting monomers reagents on to a support using an ink jet printer. 
See id. and Pease et al., EP 728,520. 

cDNA probes may be prepared according to methods known in the art and further 

1 5 described herein, e.g., reverse-transcription PGR (RT-PCR) of RNA using sequence specific 
primers. Oligonucleotide probes may be synthesized chemically. Sequences of the genes or 
cDNA from which probes are made may be obtained, e.g., from GenBank, other public databases 
or publications. 

Nucleic acid probes may be natural nucleic acids, chemically modified nucleic acids, e.g., 
20 composed of nucleotide analogs, as long as they have activated hydroxyl groups compatible with 
the linking chemistry. The protective groups can, themselves, be photolabile. Altematively, the 
protective groups may be labile under certain chemical conditions, e.g., acid. In this example, 
the surface of the solid support may contain a composition that generates acids upon exposure to 
light. Thus, exposiire of a region of the substrate to Ught generates acids in that region that 
25 remove the protective groups in the exposed region. Also, the synthesis method may use 3'- 
protected 5'-0-phosphoramidite-activated deoxynucleoside. In this case, the oligonucleotide is 
synthesized in the 5* to 3* direction, which results in a free 5* end. 

In one embodiment, oUgonucleotides of an array are synthesized using a 96 well 
automated multiplex oligonucleotide synthesizer (A.M.O.S.) that is capable of making thousands 
30 of oligonucleotides (Lashkari et al. (1995) PNAS 93: 7912) may be used. 
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It will be appreciated that oligonucleotide design is influenced by the intended 
appUcation. For example, it may be desirable to have similar melting temperatures for all of the 
probes. Accordingly, the length of the probes are adjusted so that the melting temperatures for all 
of the probes on the array are closely similar (it will be appreciated that different lengths for 
5 different probes may be needed to achieve a particular melting temperature where different 
probes have different GC contents). Although meUing temperature is a primary consideration in 
probe design, other factors are optionally used to further adjust probe construction, such as 
selecting against primer self-complementarity and the like. 

Arrays, e.g., microarrrays, may conveniently be stored following fabrication or purchase 

10 for use at a later time. Under appropriate conditions, the subject arrays are capable of being 
stored for at least about 6 months and may be stored for up to one year or longer. Arrays are 
generally stored at temperatures between about -20° C. to room temperature, where the arrays are 
preferably sealed in a plastic container, e.g. bag, and shielded from light. 
J, 1 Hybridization of the target nucleic acids to the microarray 

15 The next step is to contact the labeled nucleic acids with the array under conditions 

sufficient for binding between the probe and the target of the array. In a preferred embodiment, 
the probe will be contacted with the array under conditions sufficient for hybridization to occur 
between the labeled nucleic acids and probes on the microarray, where the hybridization 
conditions will be selected in order to provide for the desired level of hybridization specificity. 

20 Contact of the array and probe involves contacting the array with an aqueous medium 

comprising the probe. Contact may be achieved in a variety of different ways depending on 
specific configuration of the array. For example, where the array simply comprises the pattern of 
size separated targets on the surface of a "plate-like" rigid substrate, contact may be 
accomplished by simply placing the array in a container comprising the probe solution, such as a 

25 polyethylene bag, and the like. In other embodiments where the array is entrapped in a 

separation media bounded by two rigid plates, the opportunity exists to deliver the probe via 
electrophoretic means. Altematively, where the array is incorporated into a biochip device 
having fluid entry and exit ports, the probe solution may be introduced into the chamber in which 
the pattern of target molecules is presented through the entry port, where fluid introduction could 

30 be performed manually or with an automated device. In multiwell embodiments, the probe 
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solution will be introduced in the reaction chamber comprising the array, either manually, e.g. 
with a pipette, or with an automated fluid handUng device. 

Contact of the probe solution and the targets will be maintained for a sufficient period of 
time for binding between the probe and the target to occur. Although dependent on the nature of 
5 the probe and target, contact will generally be maintained for a period of time ranging from 

about 10 min to 24 hrs, usually from about 30 min to 12 hrs and more usually from about 1 hr to 
6hrs. 

When using commercially available microarrays, adequate hybridization conditions are 
provided by the manufacturer. When using non-commercial microarrays, adequate hybridization 

10 conditions may be determmed based on the following hybridization guidelines, as well as on the 
hybridization conditions described in the numerous pubUshed articles on the use of microarrays. 

Nucleic acid hybridization and wash conditions are optimally chosen so that the probe 
"specifically binds" or "specifically hybridizes" to a specific array site, i.e., the probe hybridizes, 
duplexes or binds to a sequence array site with a complementary nucleic acid sequence but does 

15 not hybridize to a site with a non-complementary nucleic acid sequence. As used herein, one 
polynucleotide sequence is considered complementary to another when, if the shorter of the 
polynucleotides is less than or equal to 25 bases, there are no mismatches using standard base- 
pairing rules or, if the shorter of the polynucleotides is longer than 25 bases, there is no more 
than a 5% mismatch. Preferably, the polynucleotides are perfectly complementary (no 

20 mismatches). It may easily be demonstrated that specific hybridization conditions result in 
specific hybridization by carrying out a hybridization assay including negative controls. 

Hybridization is carried out in conditions permitting essentially specific hybridization. 
The length of the probe and GC content will determine the Tm of the hybrid, and thus the 
hybridization conditions necessary for obtaining specific hybridization of the probe to the 

25 template nucleic acid. These factors are well known to a person of skill in the art, and may also 
be tested in assays. An extensive guide to the hybridization of nucleic acids is found in Tijssen 
(1993), Laboratory Techniques in biochemistry and molecular biology-hybridization with 
nucleic acid probes. Generally, stringent conditions are selected to be about 5°C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm 

30 is the temperature (under defined ionic strength and pH) at which 50% of the target sequence 
hybridizes to a perfectly matched probe. Highly stringent conditions are selected to be equal to 
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the Tm point for a particular probe. Sometimes the term "Td** is used to define the temperature 
at which at least half of the probe dissociates fi'om a perfectly matched target nucleic acid. In any 
case, a variety of estimation techniques for estimating the Tm or Td are available, and generally 
described in Tijssen, supra. Typically, G-C base pairs in a duplex are estimated to contribute 
5 about 'i^C to the Tm, while A-T base pairs are estimated to contribute about 2°C, up to a 

theoretical maximum of about 80-1 OO^C. However, more sophisticated models of Tm and Td 
are available and appropriate in which G-C stacking interactions, solvent effects, the desired 
assay temperature and the like are taken into account. For example, probes may be designed to 
have a dissociation temperature (Td) of approximately 60°C, using the formula: Td = (((((3 x 

10 #GC) + (2 X #AT)) x 37) - 562)/#bp) - 5; where #GC, #AT, and #bp are the number of guanine- 
cytosine base pairs, the number of adenine-thymine base pairs, and the number of total base 
pairs, respectively, involved in the annealing of the probe to the template DNA. 

The stability difference between a perfectly matched duplex and a mismatched duplex, 
particularly if the mismatch is only a single base, may be quite small, corresponding to a 

15 difference in Tm between the two of as little as 0.5 degrees. See Tibanyenda, N. et al., Eur. J. 
Biochem. 139:19 (1984) and Ebel, S. et al, Biochem. 31:12083 (1992). More importantly, it is 
understood that as the length of the homology region increases, the effect of a single base 
mismatch on overall duplex stability decreases. 

Theory and practice of nucleic acid hybridization is described, e.g., in S. Agrawal (ed.) 

20 Methods in Molecular Biology^ volimie 20; and Tijssen (1993) Laboratory Techniques in 

biochemistry and molecular biology-hybridization with nucleic acid probes, e.g., part I chapter 2 
"Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, 
New York provide a basic guide to nucleic acid hybridization. 

Certain microarrays are of "active" nature, i.e., they provide independent electronic 

25 control over all aspects of the hybridization reaction (or any other affinity reaction) occurring at 
each specific microlocation. These devices provide a new mechanism for affecting hybridization 
reactions which is called electronic stringency control (ESC). The active devices of this . 
invention may electronically produce "different stringency conditions" at each microlocation. 
Thus, all hybridizations may be carried out optimally in the same bulk solution. These arrays are 

30 described in U.S. Patent No. 6,051,380 by Sosnowski et al. 
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In a preferred embodiment, background signal is reduced by the use of a detergent (e.g, 
C-TAB) or a blocking reagent (e.g., sperm DNA, cot-1 DNA, etc.) during the hybridization to 
reduce non-specific binding. In a particularly preferred embodiment, the hybridization is 
performed in the presence of about 0.5 mg/ml DNA (e.g., herring sperm DNA). The use of 

5 blocking agents in hybridization is well known to those of skill in the art (see, e.g., Chapter 8 in 
Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With 
Nucleic Acid Probes, P. Tijssen, ed. Elsevier, N.Y., (1993)). 

The method may or may not further comprise a non-boimd label removal step prior to the 
detection step, depending on the particular label employed on the target nucleic acid. For 

10 example, in certain assay formats (e.g., "homogenous assay formats") a detectable signal is only 
generated upon specific binding of target to probe. As such, in these assay formats, the 
hybridization pattem may be detected without a non-bound label removal step. In other 
embodiments, the label employed will generate a signal whether or not the target is specifically 
bound to its probe. In such embodiments, the non-bound labeled target is removed firom the 

15 support surface. One means of removing the non-bound labeled target is to perform the well 
known technique of washing, where a variety of wash solutions and protocols for their use in 
removing non-bound label are known to those of skill in the art and may be used. Altematively, 
non-bound labeled target may be removed by electrophoretic means. 

Where all of the target sequences are detected using the same label, different arrays will 

20 be employed for each physiological source (where different could include using the same array at 
different times). The above methods may be varied to provide for multiplex analysis, by 
employing different and distinguishable labels for the different target populations (representing 
each of the different physiological sources being assayed). According to this multiplex method, 
the same array is used at the same time for each of the different target populations. 

25 In another embodiment, hybridization is monitored in real time using a charge-coupled 

device imaging camera (Guschin et al. (1997) Anal Biochem. 250:203). Synthesis of arrays on 
optical fibre bundles allows easy and sensitive reading (Healy et al. (1997) Anal Biochem. 
251 :270). In another embodiment, real time hybridization detection is carried out on microarrays 
without washing using evanescent wave effect that excites only fluorophores that are boimd to 

30 the surface (see, e.g., Stimpson et al. (1995) PNAS 92:6379). 
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5.2. Detection of hybridization and analysis of results 

The above steps result in the production of hybridization patterns of labeled target nucleic 
acid on the array surface. The resultant hybridization patterns of labeled nucleic acids may be 
visualized or detected in a variety of ways, with the particular manner of detection being chosen 

5 based on the particular label of the target nucleic acid, where representative detection means 
include scintillation counting, autoradiography, fluorescence measurement, colorimetric 
measurement, light emission measurement, light scattering, and the like. 

One method of detection includes an array scanner that is commercially available from 
Affymetrix (Santa Clara, CA), e.g., the 417™ Arrayer, the 418™ Array Scanner, or the Agilent 

10 GeneArray Scanner. This scanner is controlled from the system computer with a Windows 
interface and easy-to-use software tools. The output is a 16-bit.tif file that may be directly 
imported into or directly read by a variety of software applications. Preferred scanning devices 
are described in, e.g., U.S. Pat. Nos. 5,143,854 and 5,424,186. 

When fluorescently labeled probes are used, the fluorescence emissions at each site of a 

15 transcript array may be, preferably, detected by scanning confocal laser microscopy. In one 

embodiment, a separate scan, using the appropriate excitation line, is carried out for each of the 
two fluorophores used. Alternatively, a laser may be used that allows simultaneous specimen 
illumination at wavelengths specific to the two fluorophores and emissions from the two 
fluorophores may be analyzed simultaneously {see Shalon et al, (1996), Genome Research 

20 6:639-645, which is incorporated by reference in its entirety for all purposes). In a preferred 
embodiment, the arrays are scanned with a laser fluorescent scanner with a computer controlled 
X-Y stage and a microscope objective. Sequential excitation of the two fluorophores may be 
achieved with a multi-line, mixed gas laser and the emitted light is split by wavelength and 
detected with two photomultiplier tubes. Fluorescence laser scanning devices are described in 

25 Schena et al., (1996), Genome Res. 6:639-645 and in other references cited herein. Altematively, 
the fiber-optic bundle described by Ferguson et al., (1996), Nature Biotech, 14: 1681-1684, may 
be used to monitor mRNA abimdance levels. 

In one embodiment in which fluorescent target nucleic acids are used, the arrays may be 
scanned using lasers to excite fluorescently labeled targets that have hybridized to regions of 

30 probe arrays, which may then be imaged using charged coupled devices ("CCDs") for a wide 
field scanning of the array. Altematively, another particularly usefiil method for gathering data 
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from the arrays is through the use of laser confocal microscopy which combines the ease and 
speed of a readily automated process with high resolution detection. 

Following the data gathering operation, the data will typically be reported to a data 
analysis operation. To facilitate the sample analysis operation, the data obtained by the reader 
5 from the device will typically be analyzed using a digital computer. Typically, the computer will 
be appropriately programmed for receipt and storage of the data from the device, as well as for 
analysis and reporting of the data gathered, e.g., subtraction of the background, deconvolution 
multi-color images, flagging or removing artifacts, verifying that controls have performed 
properly, normalizing the signals, interpreting fluorescence data to determine the amoxmt of 

10 hybridized target, normalization of background and single base mismatch hybridizations, and the 
like. In a preferred embodiment, a system comprises a search ftmction that allows one to search 
for specific pattems, e.g., pattems relating to differential gene expression, e.g., between the 
expression profile of a cell of a subject having a disorder of CD8+ T cell priming, or a disease in 
which CD8+ T cell priming is a component, and the expression profile of a counterpart normal 

15 cell in a subject. A system preferably allows one to search for pattems of gene expression 
between more than two samples. 

A desirable system for analyzing data is a general and flexible system for the 
visualization, manipulation, and analysis of gene expression data. Such a system preferably 
includes a graphical user interface for browsing and navigating through the expression data, 

20 allowing a user to selectively view and highUght the genes of interest. The system also 

preferably includes sort and search fimctions and is preferably available for general users with 
PC, Mac or Unix workstations. Also preferably included in the system are clustering algorithms 
that are qualitatively more efficient than existing ones. The accuracy of such algorithms is 
preferably hierarchically adjustable so that the level of detail of clustering may be systematically 

25 refined as desired. 

Various algorithms are available for analyzing the gene expression profile data, e.g., the 
type of comparisons to perform. In certain embodiments, it is desirable to group genes that are 
co-regulated. This allows the comparison of large numbers of profiles. A preferred embodiment 
for identifying such groups of genes involves clustering algorithms (for reviews of clustering 

30 algorithms, see, e.^., Fukunaga, 1990, Statistical Pattern Recognition, 2nd Ed,, Academic Press, 
San Diego; Everitt, 1974, Cluster Analysis, Lx)ndon: Heinemann Educ. Books; Hartigan, 1975, 
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Clustering Algorithms, New York: Wiley; Sneath and Sokal, 1973, Numerical Taxonomy , 
Freeman; Anderberg, 1973, Cluster Analysis for Applications y Academic Press: New York). 

Clustering analysis is useful in helping to reduce complex patterns of thousands of time 
curves into a smaller set of representative clusters. Some systems allow the clustering and 
5 viewing of genes based on sequences. Other systems allow clustering based on other 

characteristics of the genes, e.g., their level of expression (see, e.g., U.S. Patent No. 6,203,987). 
Other systems permit clustering of time ciuves (see, e.g. U.S. Patent No. 6,263,287). Cluster 
analysis may be performed using the hclust routine (see, e.g., "hclusf* routine from the software 
package S-Plus, MathSoft, Inc., Cambridge, Mass.). 

10 In some specific embodiments, genes are grouped according to the degree of co-variation 

of their transcription, presumably co-regulation, as described in U.S. Patent No. 6,203,987. 
Groups of genes that have co-varying transcripts are termed "genesets." Cluster analysis or other 
statistical classification methods may be used to analyze the co-variation of transcription of 
genes in response to a variety of perturbations, e.g. caused by a disease or a drug. In one specific 

15 embodiment, clustering algorithms are applied to expression profiles to construct a "similarity 
tree" or "clustering tree" which relates genes by the amount of co-regulation exhibited. Genesets 
are defined on the branches of a clustering tree by cutting across the clustering tree at different 
levels in the branching hierarchy. 

In some embodiments, a gene expression profile is converted to a projected gene 

20 expression profile. The projected gene expression profile is a collection of geneset expression 
values. The conversion is achieved, in some embodiments, by averaging the level of expression 
of the genes within each geneset. In some other embodiments, other linear projection processes 
may be used. The projection operation expresses the profile on a smaller and biologically more 
meaningful set of coordinates, reducing the effects of measurement errors by averaging them 

25 over each cellular constituent sets and aiding biological interpretation of the profile. 

6. Diagnostics and Prognostics 

In one embodiment, the invention provides diagnostic methods for monitoring the 
existence and/or evolution of disorders of CD8+ T cell priming or diseases in which CD8+ T cell 
30 priming is a component in a subject. For example, the invention provides methods for predicting 
whether a subject is likely to develop said disorder or disease; methods for confirming that a 
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subject, who has been diagnosed as having such a disorder or disease with traditional methods, 
has a certain disorder or disease, and not, e.g., a disease that is phenotypically related to it; and 
methods for monitoring the progression of the disease, e.g., in a subject imdergoing treatment. 
Preferred methods comprise evaluating the level of expression of one or more genes that are 
5 differentially expressed during CD8+ T cell priming in the CD8+ T cells of a subject who may 
have a disorder of CD8+ T cell priming or disease in which CD8+ T cell priming is a 
component. Other methods comprise evaluating the level of expression of tens, hundreds or 
thousands of genes that are differentially expressed during CD8+ T cell priming, e.g., by using 
microarray technology. The expression levels of the genes are then compared to the expression 

10 levels of the same genes one or more other cells, e.g., a cell from a normal or diseased subject. 
Comparison of the expression levels maybe performed visually. In a preferred 
embodiment, the comparison is performed by a computer. In one embodiment, expression levels 
of genes that are differentially expressed during CD8-I- T cell priming in cells of subjects who 
may have a disease or disorder are stored in a computer. The computer may optionally comprise 

15 expression levels of these genes in cells or nomial subjects. The data representing expression 
levels of the genes in a patient being diagnosed are then entered into the computer, and compared 
with one or more of the expression levels stored in the computer. The computer calculates 
differences and presents data showing the differences in expression of the genes in the two types 
of cells. 

20 In one embodiment, a cell sample from a subject is obtained from a caregiver, the level of 

expression of one or more genes whose expression is characteristic of a disorder of CD8+ T cell 
priming, or disease in which CD8-I- T cell priming is a component is determined, the expression 
data are entered into a computer comprising a plurality of reference expression data associated 
with particular therapies and compared thereto, to determine the most appropriate therapy for the 

25 patient. The method may further comprise sending, e.g., to the caregiver, the identity of the 

appropriate therapy. The data and identity of the appropriate therapy may be sent via a network, 
e.g., the internet. 

In other embodiments of the diagnostic methods contemplated by the present invention, 
the method of diagnosis comprises the steps of evaluating the activity of a protein encoded by a 
30 gene selected from the panels of the invention in the CD8+ T cells of a subject, and comparing 
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the activity of said protein in said subject's cells with that in a normal CD8+ T cell of the same 
type. In certain embodiments, a particular type of disorder or disease may be diagnosed if the 
protein whose activity is determined is associated with a particular type of disease or disorder. 

Exemplary diagnostic tools and assays are set forth below, imder (i) to (vi), followed by 
5 exemplary methods for conducting these assays. The assays may optionally utilize the 
microarrays of the invention. 

(i) In one embodiment, the invention provides a method for evaluating whether a 
subject has or is likely to develop a disorder of CD84- T cell priming or a disease in which CD8+ 
T cell priming is a component, comprising evaluating the level of expression of one or more 

10 genes which are up- or down-regulated during CD8+ T cell priming in a cell of the subject and 
comparing these levels of expression with the levels of expression of the genes in a diseased cell 
of a subject known to have a disorder of CD8+ T cell priming or a disease in which CD8+ T cell 
priming is a component, such that a similar level of expression of the genes is indicative that the 
subject has or is likely to develop a disorder of CD8-I- T cell priming or a disease in which CD8+ 

15 T cell priming is a component or at least a symptom thereof In a preferred embodiment, the cell 
is essentially of the same type as that which is diseased in the subject. 

(ii) In another embodiment the expression profiles of genes in the panels of the 
invention may be used to confirm that a subject has a specific disorder of CD8+ T cell priming or 
disease in which CD8+ T cell priming is a component, and in particular, that the subject does not 

20 have a related disease or disease with similar symptoms. This may be important, in particular, 
in designing an optimal therapeutic regimen for the subject. It has been described in the art that 
expression profiles may be used to distinguish one type of disease fi'om a similar disease. For 
example, two subtypes of non-Hodgkin's lymphomas, one of which responds to current 
therapeutic methods and the other one which does not, could be differentiated by investigating 

25 17,856 genes in specimens of patients suffering fi-om diffuse large B-cell lymphoma (AHzadeh et 
al. Nature (2000) 405:503). Similarly, subtypes of cutaneous melanoma were predicted based on 
profiling 8150 genes (Bittner et al. Nature (2000) 406:536). In this case, features of the highly 
aggressive metastatic melanomas could be recognized. Numeroxis other studies comparing 
expression profiles of diseased cells and normal cells have been described, including studies 

30 describing expression profiles distinguishing between highly and less metastatic cancers and 
studies describing new subtypes of diseases, e.g., new tumor types {see, e.g., Perou et al. (1999) 
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PNAS 96: 9212; Perou et al. (2000) Nature 606:747; Clark et al, (2000) Nature 406:532; Alon et 
al. (1999) PNAS 96:6745; Golub et al. (1999) Science 286:531). 

Accordingly, the expression profile of the invention allows the distinction of one disorder 
of CD8+ T cell priming, or disease in which CD8+ T cell priming is a component, firom related 
5 diseases. In a preferred embodiment, the level of expression of one or more genes that are 
differentially expressed during CD8+ T cell priming is determined in a cell of the subject.. In 
another embodiment, the level of expression of essentially all of the genes differentially 
expressed during CD8+ T cell priming is determined in a cell of the subject, such as by using a 
microarray comprising probes corresponding to all of or essentially all of the genes identified in 

1 0 FIGURE 1 . A level of expression of one or more genes characteristic of a disorder of CD8+ T 
cell priming or a disease in which CD8+ T cell priming is a component, and not of related 
diseases, that is similar to that in a cell of a subject with a disorder of CD8+ T cell priming or a 
disease in which CD8+ T cell priming is a component indicates that the subject has a disorder of 
CD8+ T cell priming or a disease in which CD8+ T cell priming is a component. 

15 Prior to using this method for evaluating whether the subject has a disorder of CD8+ T 

cell priming or a disease in which CD8+ T cell priming is a component, it may be necessary to 
first determine the expression profile of cells of diseases that are similar to a disorder of CD8+ T 
cell priming or a disease in which CD8-I- T cell priming is a component and cells fi-om numerous 
subjects having a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is 

20 a component as diagnosed by traditional (i.e., non microarray based) methods. This may be 
undertaken using a microarray containing the panel of genes differentially expressed during 
CD8+ T cell priming according to methods further described herein. 

(iii) In yet another embodiment, the invention provides methods for evaluating the 
stage of a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a 

25 component in the subject. It is likely that the level of expression of the genes that are 

characteristic of a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is 
a component changes with the stage of the disease. This could be confirmed, e.g., by analyzing 
the level of expression of these genes in subjects having a disorder of CD8+ T cell priming or a 
disease in which CD8+ T cell priming is a component at different stages, as determined by 

30 traditional methods. For example, the expression profile of a diseased cell in subjects at different 
stages of the disease may be determined as described herein. Then, to determine the stage of a 
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disorder of CD8+ T cell priming or a disease in which CD8-I- T cell priming is a component in a 
subject, the level of expression of one or more genes that are characteristic of the disorder and 
whose level of expression varies with the stage of the disease is determined. A similar level of 
expression of one or more genes whose expression is characteristic of a disorder of CD8+ T cell 
5 priming or a disease in which CD8+ T cell priming is a component between that in a subject and 
that in a reference profile of a particular stage of the disease, indicates that the disorder or disease 
of the subject is at the particular stage. 

(iv) Similarly, the method may be used to detemiine the stage of the disease in a subject 
undergoing therapy, and thereby determine whether the therapy is effective. Accordingly, in one 

10 embodiment, the level of expression of one or more genes involved in a disorder of CD8+ T cell 
priming or a disease in which CD8+ T cell priming is a component is determined in a subject 
before the treatment and several times during the treatment. For example, a sample of RNA may 
be obtained from the subject before the begiiming of the therapy and every 12, 24 or 72 hours 
during the therapy. Samples may also be analyzed one a week or once a month. Changes in 

1 5 expression levels of genes whose expression is characteristic of a disorder of CD8H- T cell 
priming or a disease in which CD8+ T cell priming is a component over time and relative to 
diseased cells and normal cells will indicate whether the therapy is effective. 

(v) In yet another embodiment, the invention provides a method for evaluating the 
likelihood of success of a particular therapy in a subject having a disorder of CD8+ T cell 

20 priming or a disease in which CD8+ T cell priming is a component. In one embodiment, a 

subject is started on a particular therapy, and the effectiveness of the therapy is determined, e.g., 
by evaluating the level of expression of one or more genes whose expression is characteristic of a 
disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component in a 
cell of the subject. A normalization of the level of expression of these genes, i.e., a change in the 

25 expression level of the genes such that their level of expression resembles more that of a cell of a 
normal subject, indicates that the treatment should be effective in the subject. On the other hand, 
the absence of normalization of the level of expression of the genes involved in a disorder of 
CD8+ T cell priming or a disease in which CD8+ T cell priming is a component indicates that 
the treatment is not likely to be effective in the subject. 

30 Prediction of the outcome of a treatment of a disorder of CD8+ T cell priming or a 

disease in which CD8+ T cell priming is a component in a subject may also be undertaken in 
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vitro. In one embodiment, cells are obtained from a subject to be evaluated for responsiveness to 
the treatment, and incubated in vitro with the therapeutic drug. The level of expression of one or 
more genes involved in CD8+ T cell priming is then measured in the cells and these values are 
compared to the level of expression of these one or more genes in a cell which is the normal 

5 counterpart cell of a cell taken from a patient with a disease or disorder. The level of expression 
may also be compared to that in a normal cell. In a preferred embodiment, the level of 
expression of essentially all the genes that are differentially regulated during CD8+ T cell 
priming, i.e., the genes shown in FIGURE 1, is determined. The comparative analysis is 
preferably conducted using a computer comprising a database comprising the level of expression 

10 of at least one gene differentially expressed during CD8+ T cell priming in a cell taken from a 
diseased and/or normal individual. A level of expression of one or more genes that are 
differentially expressed during CD8+ T cell priming in the cells of the subject after incubation 
with the drug that is similar to their level of expression in cell from a normal subject and 
different from a cell taken from a diseased subject is indicative that it is likely that the subject 

15 will respond positively to a treatment with the drug. On the contrary, a level of expression of 
one or more genes whose expression that are differentially expressed during CD8+ T cell 
priming in the cells of the subject after incubation with the drug that is similar to their level of 
expression in cell from a diseased subject and different from a cell taken from a normal subject is 
indicative that it is likely that the subject will not respond positively to a treatment with the drug. 

20 Since it is possible that a drug for treating a disorder of CD8+ T cell priming or a disease 

in which CD8+ T cell priming is a component does not act directly on the CD8+ T cells, but is, 
e.g., metabolized, or acts on another cell which then secretes a factor that will affect the CD8+ T 
cells, the above assay may also be conducted in a tissue sample of a subject, which contains cells 
other than the CD8+ T cells. For example, a blood sample comprising CD8+ T cells is obtained 

25 from a subject; the sample is incubated with the potential drug; optionally one or more CD8+ T 
cells are isolated from the sample, and the expression level of one or more genes that are 
differentially expressed during CD8+ T cell priming is examined. 

(vi) The invention may also provide methods for selecting a therapy for a disorder of 
CD8+ T cell priming or a disease in which CD8+ T cell priming is a component for a patient 

30 from a selection of several different treatments. Certain subjects having a disorder of CD8+ T 
cell priming or a disease in which CD8+ T cell priming is a component may respond better to 
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one type of therapy than another type of therapy. In a preferred embodiment, the method 
comprises comparing the expression level of at least one gene characteristic of a disorder of 
CD8+ T cell priming or a disease in which CD8+ T cell priming is a component in the patient 
with that in cells of subjects treated in vitro or in vivo with one of several therapeutic drugs, 

5 which subjects are responders or non responders to one of the therapeutic drugs, and identifying 
the cell which has the most similar level of expression of the one or more genes to that of the 
patient, to thereby identify a therapy for the patient. The method may further comprise 
administering the therapy identified to the subject. 

A person of skill in the art will recognize that in certain diagnostic and prognostic assays, 

10 it will be sufficient to assess the level of expression of a single gene that is differentially 
expressed during CD8+ T cell priming and that in others, the expression of two or more is 
preferred, whereas still in others, the expression of essentially all the genes involved in CD8+ T 
cell priming is preferably assessed. 

Set forth below are exemplary methods which may be used to determine the level of 

15 expression of one or more genes differentially expressed dxuing CD8+ T cell priming, e.g., for 
use in the above-described methods. For example, the level of expression of a gene may be 
determined by reverse transcription-polymerase chain reaction (RT-PCR); dotblot analysis; 
Northem blot analysis and in situ hybridization. In a preferred embodiment, the level of 
expression is determined by using a microarray which contains probes of the genes that are up- 

20 or down-regulated during CD8+ T cell priming. In another embodiment, the level of protein 

encoded by one or more of the genes that are up- or down-regulated during CD8+ T cell priming 
is determined in a cell taken fi-om a subject having a disorder of CD8+ T cell priming or a 
disease in which CD8+ T cell priming is a component. This may be done by a variety of 
methods, e.g., immunohistochemistry. 

25 6.1. Use of microarray s for evaluating the level of expression of genes that are 

differentially regulated during CD8^ Tcell priming 

Generally, evaluating expression profiles with microarrays involves the following steps: 
(a) obtaining a mRNA sample fi*om a subject and preparing labeled nucleic acids therefirom (the 
"target nucleic acids" or '^targets"); (b) contact of the target nucleic acids with the array under 

30 conditions sufficient for target nucleic acids to bind with corresponding probe on the array, e.g. 
by hybridization or specific binding; (c) optional removal of unbound targets fi"om the array; and 
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(d) detection of bound targets, and analysis of the results, e.g., using computer based analysis 
methods. "Nucleic acid probes" or "probes" are nucleic acids attached to the array, whereas 
"target nucleic acids" are nucleic acids that are hybridized to the array. Each of these steps is 
described in more detail below. 
5 (i) Obtaining a mRNA sample of a subject 

Nucleic acid specimens may be obtained from an individual to be tested using either 
"invasive" or "non-invasive" sampling means. A sampling means is said to be "invasive" if it 
involves the collection of nucleic acids from within the skin or organs of an animal (including, 
especially, a murine, a human, an ovine, an equine, a bovine, a porcine, a canine, or a feline 

10 animal). Examples of invasive methods include blood collection, semen collection, needle 

biopsy, pleural aspiration, umbilical cord biopsy, etc. Examples of such methods are discussed 
by Kim, C. H. et al. (J. Virol 66:3879-3882 (1992)); Biswas, B. et al. {Annals NY Acad. Sci. 
590:582-583 (1990)); Biswas, B. et al. (7. Clin, Microbiol 29:2228-2233 (1991)). 

In one embodiment, one or more cells from the subject to be tested are obtained and RNA 

15 is isolated from the cells. In a preferred embodiment, a sample of T cells is obtained from the 
subject. When obtaining the cells, it is preferable to obtain a sample containing predominantly 
cells of the desired type, e.g., a sample of cells in which at least about 50%, preferably at least 
about 60%, even more preferably at least about 70%, 80% and even more preferably, at least 
about 90% of the cells are of the desired type. A higher percentage of cells of the desired type is 

20 preferable, since such a sample is more likely to provide clear gene expression data. Blood 
samples may be obtained according to methods known in the art. 

It is also possible to obtain a cell sample from a subject, and then to enrich it in the 
desired cell type. For example, cells may be isolated from other cells using a variety of 
techniques, such as isolation with an antibody binding to an epitope on the cell surface of the 

25 desired cell type. 

In one embodiment, RNA is obtained from a single cell. It is also possible to obtain cells 
from a subject and culture the cells in vitro, such as to obtain a larger population of cells from 
which RNA may be extracted. Methods for establishing cultures of non-transformed cells, i.e., 
primary cell cultures, are known in the art. 

30 When isolating RNA from tissue samples or cells from individuals, it may be important 

to prevent any ftirther changes in gene expression after the tissue or cells has been rmioved from 
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the subject. Changes in expression levels are known to change rapidly following perturbations, 
e.g., heat shock or activation with lipopolysaccharide (LPS) or other reagents. In addition, the 
RNA in the tissue and cells may quickly become degraded. Accordingly, in a preferred 
embodiment, the cells obtained from a subject are snap frozen as soon as possible. 

5 RNA may be extracted from the tissue sample by a variety of methods, e.g., the 

guanidium thiocyanate lysis followed by CsCl centrifiigation (Chirgwin et al., 1979, 
Biochemistry 18:5294-5299). RNA from single cells may be obtained as described in methods 
for preparing cDNA libraries from single cells, such as those described in Dulac, C. (1998) Curr, 
Top. Dev. Biol 36, 245 and Jena et al. (1996) J. Immunol Methods 190:199. Care to avoid RNA 

1 0 degradation must be taken, e.g., by inclusion of RNAsin. 

The RNA sample may then be enriched in particular species. In one embodiment, 
poly(A)-f- RNA is isolated from the RNA sample. In general, such purification takes advantage 
of the poly- A tails on mRNA. In particular and as noted above, poly-T oUgonucleotides may be 
immobilized on a soUd support to serve as affinity ligands for mRNA. Kits for this purpose are 

15 conmiercially available, e.g., the MessageMaker kit (Life Technologies, Grand Island, NY). 

In a preferred embodiment, the RNA population is enriched in sequences of interest, such 
as those of the genes differentially expressed in CD8+ T cells. Enrichment may be undertaken, 
e.g., by primer-specific cDNA synthesis, or multiple rounds of linear amplification based on 
cDNA synthesis and template-directed in vitro transcription {see, e.g., Wang et al. (1989) PNAS 

20 86, 9717; Dulac et al., supra, and Jena et al., supra). 

The population of RNA, enriched or not in particular species or sequences, may fiuther 
be amplified. Such amplification is particularly important when using RNA from a single or a 
few cells. A variety of amplification methods are suitable for use in the methods of the 
invention, including, e.g., PCR; Ugase chain reaction (LCR) {see, e.g., Wu and Wallace, 

25 Genomics 4, 560 (1989), Landegren et al.. Science 241, 1077 (1988)); self-sustained sequence 
replication (SSR) {see, e.g., GuateUi et al., PNAS, 87, 1874 (1990)); nucleic acid based sequence 
amplification (NASBA) and transcription ampUfication {see, e.g., Kwoh et al., PNAS 86, 1 173 
(1989)). For PCR technology, see, e.g., PCR Technology: Principles and Applications forDNA 
Amplification (ed. H. A. Erlich, Freeman Press, N.Y., N.Y., 1992); PCR Protocols: A Guide to 

30 Methods and applications (eds. Innis, et al.. Academic Press, San Diego, Calif, 1990); Mattila et 
al.. Nucleic Acids Res. 19, 4967 (1991); Eckert et al., PCR Methods and Applications 1, 17 
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(1991); PCR (eds. McPherson et al., IRL Press, Oxford); and U.S. Pat. No. 4,683,202. Methods 
of amplification are described, e.g., in Ohyama et al. (2000) BioTechniques 29:530; Luo et al. 
(1999) Nat Med, 5, 1 17; Hegde et al. (2000) BioTechniques 29:548; Kacharmina et al. (1999) 
Meth. Enzymol 303:3; Livesey et al (2000) Curr. Biol 10:301; Spirin et al. (1999) Invest. 
5 Ophtalmol Vis, ScL 40:3108; and Sakai et al. (2000) Anal Biochem. RNA amplification 

and cDNA synthesis may also be conducted in cells in situ (see, e.g,, Eberwine et al. (1992) 
PNAS S9:3010). 

One of skill in the art will appreciate that whatever amplification method is used, if a 
quantitative result is desired, care must be taken to use a method that maintains or controls for 

10 the relative frequencies of the amplified nucleic acids to achieve quantitative amplification. 

Methods of "quantitative" amplification are well known to those of skill in the art. For example, 
quantitative PCR involves simultaneously co-amplifying a known quantity of a control sequence 
using the same primers. This provides an internal standard that may be used to calibrate the PCR 
reaction. A high density array may then include probes specific to the intemal standard for 

1 5 quantification of the amplified nucleic acid. 

One preferred intemal standard is a synthetic AW106 RNA. The AW106 RNA is 
combined with RNA isolated from the sample according to standard techniques known to those 
of skilled in the art. The RNA is then reverse transcribed using a reverse transcriptase to provide 
copy DNA. The cDNA sequences are then amplified (e.g., by PCR) using labeled primers. The 

20 amplification products are separated, typically by electrophoresis, and the amount of 

radioactivity (proportional to the amount of amplified product) is determined. The amount of 
mRNA in the sample is then calculated by comparison with the signal produced by the known 
AW106 RNA standard. Detailed protocols for quantitative PCR are provided in PCR Protocols, 
A Guide to Methods and Applications, Innis et al., Academic Press, Inc. N.Y., (1990). 

25 In a preferred embodiment, a sample mRNA is reverse transcribed with a reverse 

transcriptase and a primer consisting of oligo(dT) and a sequence encoding the phage T7 
promoter to provide single stranded DNA template. The second DNA strand is polymerized 
using a DNA polymerase. After synthesis of double-stranded cDNA, T7 RNA polymerase is 
added and RNA is transcribed from the cDNA template. Successive rounds of transcription from 

30 each single cDNA template results in amplified RNA. Methods of in vitro polymerization are 
well known to those of skill in the art (see, e.g., Sambrook, (supra) and this particular method is 
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described in detail by Van Gelder, et al., PNAS, 87: 1663-1667 (1990) who demonstrate that in 
vitro amplification according to this method preserves the relative frequencies of the various 
RNA transcripts. Moreover, Eberwine et al., PNAS, 89: 3010-3014 provide a protocol that uses 
two rounds of amplification via in vitro transcription to achieve greater than 10^ fold 

5 amplification of the original starting material, thereby permitting expression monitoring even 
where biological samples are limited. 

It will be appreciated by one of skill in the art that the direct transcription method 
described above provides an antisense (aRNA) pool. Where antisense RNA is used as the target 
nucleic acid, the oUgonucleotide probes provided in the array are chosen to be complementary to 

10 subsequences of the antisense nucleic acids. Conversely, where the target nucleic acid pool is a 
pool of sense nucleic acids, the oligonucleotide probes are selected to be complementary to 
subsequences of the sense nucleic acids. Finally, where the nucleic acid pool is double stranded, 
the probes may be of either sense as the target nucleic acids include both sense and antisense 
strands. 

1 5 (ii) Labeling of the nucleic acids to be analyzed 

Generally, the target molecules will be labeled to permit detection of hybridization of 
target molecules to a microarray. By labeled is meant that the probe comprises a member of a 
signal producing system and is thus detectable, either directly or through combined action with 
one or more additional members of a signal producing system. Examples of directly detectable 

20 labels include isotopic and fluorescent moieties incorporated into, usually covalently bonded to, 
a moiety of the probe, such as a nucleotide monomeric unit, e.g. dNMP of the primer, or a 
photoactive or chemically active derivative of a detectable label which may be bound to a 
functional moiety of the probe molecule. 

Nucleic acids may be labeled after or during enrichment and/or ampUfication of RNAs. 

25 For example, labeled cDNA is prepared from mRNA by oligo dT-primed or random-primed 
reverse transcription, both of which are well known in the art (see, e.g., Klug and Berger, 1987, 
Methods EnzymoL 152:316-325). Reverse transcription may be carried out in the presence of a 
dNTP conjugated to a detectable label, most preferably a fluorescently labeled dNTP. 
Alternatively, isolated mRNA may be converted to labeled antisense RNA synthesized by in 

30 vitro transcription of double-stranded cDNA in the presence of labeled dNTPs (Lockhart et al., 
(1996), Nature Biotech, 14:1675, which is incorporated by reference in its entirety for all 
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purposes). In alternative embodiments, the cDNA or RNA probe may be synthesized in the 
absence of detectable label and may be labeled subsequently, e.g., by incorporating biotinylated 
dNTPs or rNTP, or some similar means (e.g., photo-cross-linking a psoralen derivative of biotin 
to RNAs), followed by addition of labeled streptavidin (e.g., phycoerythrin-conjugated 

5 streptavidin) or the equivalent. 

In one embodiment, labeled cDNA is synthesized by incubating a mixture containing 0.5 
mM dGTP, dATP and dCTP plus 0.1 mM dTTP plus fluorescent deoxyribonucleotides (e.g., 0.1 
mM Rhodamine 1 10 TTP (Perkin Ehner Cetus) or 0.1 mM Cy3 dTTP (Amersham)) with reverse 
transcriptase (e.g., SuperScript.™.II, LTI Inc.) at 42 °C for 60 min. 

10 Fluorescent moieties or labels of interest include coxmiarin and its derivatives, e.g. 7- 

amino-4-methylcoumarin, aminocoumarin, bodipy dyes, such as Bodipy FL, cascade blue, 
fluorescein and its derivatives, e.g. fluorescein isothiocyanate, Oregon green, rhodamine dyes, 
e.g. Texas red, tetramethylrhodamine, eosins and erythrosins, cyanine dyes, e.g. Cy2, Cy3, 
Cy3.5, Cy5, Cy5.5, Cy7, FluorX, macrocyclic chelates of lanthanide ions, e.g. quantum dye™, 

15 fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer, TOTAB, dansyl, 
etc. Individual fluorescent compounds which have functionalities for linking to an element 
desirably detected in an apparatus or assay of the invention, or which may be modified to 
incorporate such functionalities include, e.g., dansyl chloride; fluoresceins such as 3,6- 
dihydroxy-9-phenylxanthydrol; rhodamineisothiocyanate; N-phenyl l-amino-8- 

20 sulfonatonaphthalene; N-phenyl 2-amino-6-sulfonatonaphthalene; 4-acetamido-4-isothiocyanato- 
stilbene-2,2 -disulfonic acid; pyrene-3-sulfonic acid; 2-toluidinonaphthalene-6-sulfonate; N- 
phenyl-N-methyl-2-aminoaphthalene-6-sulfonate; ethidium bromide; stebrine; auromine-0,2-(9 - 
anthroyl)palmitate; dansyl phosphatidylethanolamine; HN^-dioctadecyl oxacarbocyanine: N,N'- 
dihexyl oxacarbocyanine; merocyanine, 4-(3'-pyrenyl)stearate; d-3-aminodesoxy-equilenin; 12- 

25 (9 -anthroyl)stearate; 2-methylanthracene; 9-vinylanthracene; 2,2*(vinylene-p- 

phenylene)bisbenzoxazoIe; p-bis(2- -methyl-5-phenyl-oxazolyl))benzene; 6-dimethylamino-l,2- 
benzophenazin; retinol; bis(3*-aminopyridinium) 1,10-decandiyl diiodide; 
sulfonaphthylhydrazone of hellibrienin; chlorotetracycline; N-(7-dimethylamino-4-methyl-2- 
oxo-3-chromenyl)maleimide; N-(p-(2benzimidazolyl)-phenyl)maleimide; N-(4- 

30 fluoranthyl)maleimide; bis(homovanillic acid); resazarin; 4-chloro-7-nitro-2,l,3- 

benzooxadiazole; merocyanine 540; resonifin; rose bengal; and 2,4-diphenyl-3(2H)-furanone. 
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(see, e.g.y Kricka, 1992, Nonisotopic DNA Probe Techniques, Academic Press San Diego, 
Calif). Many fluorescent tags are commercially available from SIGMA chemical company 
(Saint Louis, Mo.), Amersham, Molecular Probes, R&D systems (Minneapolis, Minn.), 
Pharmacia LKB Biotechnology (Piscataway, N J.), CLONTECH Laboratories, Inc. (Palo Alto, 
5 Calif.), Chem Genes Corp., Aldrich Chemical Company (Milwaukee, Wis,), Glen Research, Inc., 
GIBCO BRL Life Technologies, Inc. (Gaithersburg, Md.), Fluka Chemica-Biochemika 
Analytika (Fluka Chemie AG, Buchs, Switzerland), and Applied Biosystems (Foster City, Calif.) 
as well as other commercial sources known to one of skill. 

Chemiluminescent labels include luciferin and 2,3-dihydrophthalazinediones, e.g., 
10 luminol. 

Isotopic moieties or labels of interest include ^^P, ^^S, ^^^I, ^H, ^"^C, and the like (see 
Zhao et al., (1995) Gene 156:207; Pietu et al., (1996) Genome Res. 6:492). However, because of 
scattering of radioactive particles, and the consequent requirement for widely spaced binding 
sites, use of radioisotopes is a less-preferred embodiment. 

15 Labels may also be members of a signal producing system that act in concert with one or 

more additional members of the same system to provide a detectable signal. Illustrative of such 
labels are members of a specific binding pair, such as ligands, e.g. biotin, fluorescein, 
digoxigenin, antigen, polyvalent cations, chelator groups and the like, where the members 
specifically bind to additional members of the signal producing system, where the additional 

20 members provide a detectable signal either directly or indirectly, e.g. antibody conjugated to a 
fluorescent moiety or an enzymatic moiety capable of converting a substrate to a chromogenic 
product, e.g. alkaline phosphatase conjugate antibody and the like. 

Additional labels of interest include those that provide for signal only when the probe 
with which they are associated is specifically bound to a target molecule, where such labels 

25 include: "molecular beacons" as described in Tyagi & Kramer, Nature Biotechnology (1996) 
14:303 and EP 0 070 685 BL Other labels of interest include those described in U.S. Pat. No. 
5,563,037; WO 97/17471 and WO 97/17076. 

In some cases, hybridized target nucleic acids may be labeled following hybridization. 
For example, where biotin labeled dNTPs are used in, e.g., amplification or transcription, 

30 streptavidin linked reporter groups may be used to label hybridized complexes. 
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In other embodiments, the target nucleic acid is not labeled. In this case, hybridization 
may be determined, e.g., by plasmon resonance, as described, e.g., in Thiel et al. (1997) Anal. 
Chem. 69:4948, 

In one embodiment, a plurality (e.g., 2, 3, 4, 5 or more) of sets of target nucleic acids are 
5 labeled and used in one hybridization reaction ("multiplex" analysis). For example, one set of 
nucleic acids may correspond to RNA from one cell and another set of nucleic acids may 
correspond to RNA from another cell. The plurality of sets of nucleic acids may be labeled with 
different labels, e.g., different fluorescent labels which have distinct emission spectra so that they 
may be distinguished. The sets may then be mixed and hybridized simultaneously to one 
10 microarray. 

For example, the two different cells may be a CD8-I- T cell taken from a diseased subject 
and a cell taken from a normal subject. Alternatively, the two different cells may be a CD8+ T 
cell of a patient having a disorder of CD8+ T cell priming or a disease in which CD8+ T cell 
priming is a component, and a CD8+ T cell of a patient suspected of having a disorder of CD8+ 

15 T cell priming or a disease in which CD8+ T cell priming is a component. In another 

embodiment, one biological sample is exposed to a drug and another biological sample of the 
same type is not exposed to the drug. The cDNA derived from each of the two cell types are 
differently labeled so that they may be distinguished. In one embodiment, for example, cDNA 
from a cell taken from a diseased subject is synthesized using a fluorescein-labeled dNTP, and 

20 cDNA from a second cell, i.e., the cell taken from a normal subject, is synthesized using a 
rhodamine-labeled dNTP. When the two cDNAs are mixed and hybridized to the microarray, 
the relative intensity of signal from each cDNA set is determined for each site on the array, and 
any relative difference in abundance of a particular mRNA detected. 

In the example described above, the cDNA from the CD8+ T cell taken from a diseased 

25 subject will fluoresce green when the fluorophore is stimulated, and the cDNA from the cell of a 
subject suspected of having a disorder of CD8+ T cell priming or a disease in which CD8+ T cell 
priming is a component will fluoresce red. As a result, if the two cells are essentially the same, 
the particular mRNA will be equally prevalent in both cells and, upon reverse transcription, red- 
labeled and green-labeled cDNA will be equally prevalent. When hybridized to the microarray, 

30 the binding site(s) for that species of RNA will emit wavelengths characteristic of both 
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fluorophores (and appear brown in combination). In contrast, if the two cells are different, the 
ratio of green to red fluorescence will be different. 

The use of a two-color fluorescence labeling and detection scheme to define alterations in 
gene expression has been described, e.g., in Shena et al., (1995) Science 270:467-470. An 

5 advantage of using cDNA labeled with two different fluorophores is that a direct and internally 
controlled comparison of the mRNA levels corresponding to each arrayed gene in two cell states 
may be made, and variations due to minor differences in experimental conditions (e.g, 
hybridization conditions) will not affect subsequent analyses. 

Examples of distinguishable labels for use when hybridizing a plurality of target nucleic 

10 acids to one array are well known in the art and include: two or more different emission 

wavelength fluorescent dyes, like Cy3 and Cy5, combination of fluorescent proteins and dyes, 
like phycoerythrin and Cy5, two or more isotopes with different energy of emission, like ^^P and 
^^P, gold or silver particles with different scattering spectra, labels which generate signals under 
different treatment conditions, like temperature, pH, treatment by additional chemical agents, 

15 etc., or generate signals at different time points after treatment. Using one or more enzymes for 
signal generation allows for the use of an even greater variety of distinguishable labels, based on 
different substrate specificity of enzymes (alkaline phosphatase/peroxidase). 

Further, it is preferable in order to reduce experimental error to reverse the fluorescent 
labels in two-color differential hybridization experiments to reduce biases peculiar to individual 

20 genes or array spot locations. In other words, it is preferable to first measure gene expression 
with one labeling (e.g., labeling nucleic acid firom a first cell with a first fluorochrome and 
nucleic acid fi-om a second cell with a second fluorochrome) of the mRNA firom the two cells 
being measured, and then to measure gene expression fi-om the two cells with reversed labeling 
(e.g., labeling nucleic acid fi-om the first cell with the second fluorochrome and nucleic acid from 

25 the second cell with the first fluorochrome). Multiple measurements over exposure levels and 
perturbation control parameter levels provide additional experimental error control. 

The quality of labeled nucleic acids may be evaluated prior to hybridization to an array. 
For example, a sample of the labeled nucleic acids may be hybridized to probes derived firom the 
5', middle and 3' portions of genes known to be or suspected to be present in the nucleic acid 

30 sample. This will be indicative as to whether the labeled nucleic acids are fiiU length nucleic 
acids or whether they are degraded. In one embodiment, the GeneChip® Test3 Array firom 
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AfFymetrix (Santa Clara, CA) may be used for that purpose. This array contains probes 
representing a subset of characterized genes from several organisms including mammals. Thus, 
the quality of a labeled nucleic acid sample may be determined by hybridization of a fraction of 
the sample to an array, such as the GeneChip® Test3 Array from Affymetrix (Santa Clara, CA). 

5 6.2. Other methods for evaluating gene expression levels 

In certain embodiments, it is sufficient to determine the expression of one or only a few 
genes, as opposed to hundreds or thousands of genes. Although microarrays may be used in 
these embodiments, various other methods of detection of gene expression are available. This 
section describes a few exemplary methods for detecting and quantifying mRNA or polypeptide 

10 encoded thereby. Where the first step of the methods includes isolation of mRNA from cells, 
this step may be conducted as described above. Labeling of one or more nucleic acids may be 
performed as described above. 

In one embodiment, mRNA obtained form a sample is reverse transcribed into a first 
cDNA strand and subjected to PGR, e.g., RT-PCR. House keeping genes, or other genes whose 

1 5 expression does not vary may be used as internal controls and controls across experiments. 
Following the PCR reaction, the amplified products may be separated by electrophoresis and 
detected. By using quantitative PCR, the level of amplified product will correlate with the level 
of RNA that was present in the sample. The ampUfied samples may also be separated on a 
agarose or polyacrylamide gel, transferred onto a filter, and the filter hybridized with a probe 

20 specific for the gene of interest. Numerous samples may be analyzed simultaneously by 
conducting parallel PCR ampUfication, e.g., by multiplex PCR. 

In another embodiment, mRNA levels is determined by dotblot analysis and related 
methods (see, G. A. Beltz et al., in Methods in Enzymology, Vol. 100, Part B, R. Wu, L. 
Grossmam, K. Moldave, Eds., Academic Press, New York, Chapter 19, pp. 266-308, 1985). In 

25 one embodiment, a specified amount of RNA extracted fi-om cells is blotted (i.e., non-covalently 
bound) onto a fiher, and the filter is hybridized with a probe of the gene of interest. Numerous 
RNA samples may be analyzed simultaneously, since a blot may comprise multiple spots of 
RNA. Hybridization is detected using a method that depends on the type of label of the probe. 
In another dotblot method, one or more probes of one or more genes that are differentially 

30 expressed during CD8+ T cell priming are attached to a membrane, and the membrane is 

incubated with labeled nucleic acids obtained fi"om and optionally derived fi'om RNA of a cell or 
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tissue of a subject. Such a dotblot is essentially an array comprising fewer probes than a 
microarray. 

"Dot blot" hybridization gained wide-spread use, and many versions were developed 

{see, e.g,y M, L. M. Anderson and B. D. Young, in Nucleic Acid Hybridization-A Practical 
5 Approach, B. D. Hames and S. J. Higgins, Eds., IRL Press, Washington D.C., Chapter 4, pp. 73- 
111,1985). 

Another format, the so-called "sandwich" hybridization, involves covalently attaching 
oligonucleotide probes to a solid support and using them to capture and detect multiple nucleic 
acid targets {see, e.g., M. Ranki et al., Gene, 21, pp. 77-85, 1983; A. M. Palva, T. M. Ranki, and 

10 H. E. Soderlund, in UK Patent Application GB 2156074A, Oct. 2, 1985; T. M. Ranki and H. E. 
Soderlund in U.S. Pat. No. 4,563,419, Jan. 7, 1986; A. D. B. Malcolm and J. A. Langdale, in 
PCT WO 86/03782, Jul. 3, 1986; Y. Stabinsky, in U.S. Pat. No. 4,751,177, Jan. 14, 1988; T. H. 
Adams et al., in PCT WO 90/01564, Feb. 22, 1990; R. B. Wallace et al. 6 Nucleic Acid Res. 11, 
p. 3543, 1979; and B. J. Connor et al., 80 PNASpp. 278-282, 1983). Multiplex versions of these 

1 5 formats are called "reverse dot blots.*' 

mRNA levels may also be determined by Northern blots. Specific amounts of RNA are 
separated by gel electrophoresis and transferred onto a filter which is then hybridized with a 
probe corresponding to the gene of interest. This method, although more burdensome when 
nimierous samples and genes are to be analyzed provides the advantage of being very accurate. 

20 A preferred method for high throughput analysis of gene expression is the serial analysis 

of gene expression (SAGE) technique, first described in Velculescu et al. (1995) Science 270, 
484-487. Among the advantages of SAGE is that it has the potential to provide detection of all 
genes expressed in a given cell type, provides quantitative information about the relative 
expression of such genes, permits ready comparison of gene expression of genes in two cells, and 

25 yields sequence information that may be used to identify the detected genes. Thus far, SAGE 
methodology has proved itself to reliably detect expression of regulated and nonregulated genes 
m a variety of cell types (Velculescu et al. (1997) Cell 88, 243-251; Zhang et al. (1997) Science 
276, 1268-1272 and Velculescu et al. (1999) Nat. Genet 23, 387-388. 

Techniques for producing and probing nucleic acids are fiirther described, for example, in 

30 Sambrook et al, "Molecular Cloning: A Laboratory Manual" (New York, Cold Spring Harbor 
Laboratory, 1989). 
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Alternatively, the level of expression of one or more genes that are differentially 
expressed during CD8+ T cell priming is determined by in situ hybridization. In one 
embodiment, a tissue sample is obtained from a subject, the tissue sample is sliced, and in situ 
hybridization is performed according to methods known in the art, to determine the level of 

5 expression of the genes of interest. 

Li other methods, the level of expression of a gene is detected by measuring the level of 
protein encoded by the gene. This may be done, e.g., by immunoprecipitation, ELISA, or 
immunohistochemistry using an agent, e.g., an antibody, that specifically detects the protein 
encoded by the gene. Other techniques include Westem blot analysis. Immunoassays are 

10 commonly used to quantitate the levels of proteins in cell samples, and many other immimoassay 
techniques are known in the art. The invention is not limited to a particular assay procedure, and 
therefore is intended to include both homogeneous and heterogeneous procedures. Exemplary 
immunoassays which may be conducted according to the invention include fluorescence 
polarization immunoassay (FPIA), fluorescence immunoassay (FIA), enzyme immunoassay 

15 (EIA), nephelometric inhibition immunoassay (NIA), enzyme linked immunosorbent assay 

(ELISA), and radioimmunoassay (RIA). An indicator moiety, or label group, may be attached to 
the subject antibodies and is selected so as to meet the needs of various uses of the method which 
are often dictated by the availabiUty of assay equipment and compatible immunoassay 
procedures. General techniques to be used in performing the various immunoassays noted above 

20 are known to those of ordinary skill in the art. 

In still other methods, the level of expression of a gene is detected by measuring the level 
of protein by gel electrophoresis. A preferred method for determining the level of protein 
comprises subjecting the sample containing the proteins to be analyzed to gel electrophoresis, 
e.g., polyacrylamide electrophoresis, in the presence of specific amounts of molecular markers. 

25 After electrophoresis, protein bands or spots may be visualized using any number of methods 
know to those of skill in the art, including staining techniques such as Coomassie blue or silver 
staining, or some other agent that is standard in the art. Alternatively, autoradiography can be 
used for visualizing proteins isolated from organisms cultured on media containing a radioactive 
label, for example ."^^SOa^" or ^^[S]methionine, that is incorporated into the proteins. A 

30 comparison of the intensity of the band of the subject polypeptide with the molecular markers 
indicates the quantity of the subject polypeptide preparation, whereas a comparison of the 
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position of the band of the subject polypeptide with the molecular markers indicates the 
molecular weight and/or shape. Other methods for determining the amount and identity pf 
proteins include mass spectrometry, liquid chromatography and peptide sequencing according to 
methods known in the art. Such methods may be used directly to identify and quantitate the 
5 levels of protein expression, or used following gel electrophoresis or other separation step to do 
the same. 

Proteins may be isolated and quantified, for example, by use of chromatography. A soUd 
support that differentially binds the peptides and not the other compounds derived from a gel 
slice, a protease reaction or the sample may be used. The peptides may be eluted from the solid 

10 support into a small volume of a solution that is compatible with mass spectrometry (e.g. 50% 
acetomtrile/0.1% trifluoroacetic acid) or other analytical technique. 

To identify a protein by mass spectrometry, it may be desirable to reduce the disulfide 
bonds of the protein followed by alkylation of the free thiols prior to digestion of the protein with 
protease. The reduction may be performed by treatment of the gel slice with a reducing agent, 

1 5 for example with dithiothreitol, whereupon, the protein is alkylated by treating it with a suitable 
alkylating agent, for example iodoacetamide. Prior to analysis by mass spectrometry, the protein 
may be chemically or enzymatically digested. The protein sample in a gel slice may be 
subjected to in-gel digestion. Shevchenko A. et al., Mass Spectrometric Sequencing of Proteins 
from Silver Stained Polyacrylamide Gels. Analytical Chemistry 1996, 58, 850-858. One method 

20 of digestion is by treatment with the enzyme trypsin. The resulting peptides are extracted from 
the gel slice into a buffer, or purified by chromatography. The preparation of a protein sample 
from a gel slice that is suitable for mass spectrometry may also be done by an automated 
procedure. 
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In the case of polypeptides which are secreted from cells, the level of expression of these 
polypeptides may be measured in biological fluids. 
6. 5. Data analysis methods 

Comparison of the expression levels of one or more genes that are differentially 

5 expressed during CD8+ T cell priming with reference expression levels, e.g., expression levels in 
CD8+ T cells of a subject having a disorder of CD8+ T cell priming or a disease in which CD8+ 
T cell priming is a component or in normal counterpart cells, is preferably conducted using 
computer systems. In one embodiment, expression levels are obtained in two cells and these two 
sets of expression levels are introduced into a computer system for comparison. In a preferred 

10 embodiment, one set of expression levels is entered into a computer system for comparison with 
values that are already present in the computer system, or in computer-readable form that is then 
entered into the computer system. 

In one embodiment, the invention provides a computer readable form of the gene 
expression profile data of the invention, or of values corresponding to the level of expression of 

15 at least one gene differentially expressed during CD8+ T cell priming in a cell taken from a 
diseased subject. The values may be mRNA expression levels obtained from experiments, e.g., 
microarray analysis. The values may also be mRNA levels normalized relative to a reference 
gene whose expression is constant in numerous cells imder numerous conditions, e.g., GAPDH. 
In other embodiments, the values in the computer are ratios of, or differences between, 

20 normalized or non-normalized mRNA levels in different samples. 

The gene expression profile data may be in the form of a table, such as an Excel table. 
The data may be alone, or it may be part of a larger database, e.g., comprising other expression 
profiles. For example, the expression profile data of the invention may be part of a public 
database. The computer readable form may be in a computer. In another embodiment, the 

25 invention provides a computer displaying the gene expression profile data. 

In one embodiment, the invention provides a method for evaluating the similarity 
between the level of expression of one or more genes differentially expressed during CD8-I- T 
cells in a first cell, e.g., a cell of a subject, and that in a second cell, comprising obtaining the 
level of expression of one or more genes during CD8+ T cell priming in a first cell and entering 

30 fliese values into a computer comprising a database including records comprising values 
corresponding to levels of expression of one or more genes that are differentially regulated 
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during CD8+ T cell priming in a second cell, and processor instructions, e.g., a user interface, 
capable of receiving a selection of one or more values for comparison purposes with data that is 
stored in the computer. The computer may further comprise a means for converting the 
comparison data into a diagram or chart or other type of output. 

5 In another embodiment, values representing expression levels of genes that are 

differentially expressed during CD8+ T cell priming are entered into a computer system, 
comprising one or more databases with reference expression levels obtained from more than one 
cell. For example, the computer comprises expression data of cells taken from diseased and 
normal subjects. Instructions are provided to the computer, and the computer is capable of 

10 comparing the data entered with the data in the computer to determine whether the data entered 
is more similar to that of a cell from a diseased or a cell from a normal subject. 

In another embodiment, the computer comprises values of expression levels in cells of 
subjects at different stages of a disorder of CD8+ T cell priming or a disease in which CD8+ T 
cell priming is a component and the computer is capable of comparing expression data entered 

15 into the computer with the data stored, and produce results indicating to which of the expression 
profiles in the computer, the one entered is most similar, such as to determine the stage of a 
disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component in 
the subject. 

In yet another embodiment, the reference expression profiles in the computer are 
20 expression profiles from cells of one or more subjects having a disorder of CD8+ T cell priming 
or a disease in which CD8+ T cell priming is a component, which cells are treated in vivo or in 
vitro with a drug used for therapy of a disorder of CD8+ T cell priming or a disease in which 
CD8+ T cell priming is a component. Upon entering of expression data of a cell of a subject 
treated in vitro or in vivo with the drag, the computer is instructed to compare the data entered to 
25 the data in the computer, and to provide results indicating whether the expression data input into 
the computer are more similar to those of a cell of a subject that is responsive to the drag or more 
similar to those of a cell of a subject that is not responsive to the drag. Thus, the results indicate 
whether the subject is likely to respond to the treatment with the drag or unlikely to respond to it. 
In one embodiment, the invention provides a system that comprises a means for receiving 
30 gene expression data for one or a plxirality of genes; a means for comparing the gene expression 
data from each of said one or plurality of genes to a common reference firame; and a means for 
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presenting the results of the comparison. This system may further comprise a means for 
clustering the data. 

In another embodiment, the invention provides a computer program for analyzing gene 
expression data comprising (i) a computer code that receives as input gene expression data for a 
5 plurality of genes and (ii) a computer code that compares said gene expression data from each of 
said plurality of genes to a common reference frame. 

The invention also provides a machine-readable or computer-readable medium including 
program instructions for performing the following steps: (i) comparing a plurality of values 
corresponding to expression levels of one or more genes that are differentially expressed during 

10 CD8+ T cell in a query cell with a database including records comprising reference expression or 
expression profile data of one or more reference cells and an annotation of the type of cell; and 
(ii) indicating to which cell the query cell is most similar based on similarities of expression 
profiles. The reference cells may be cells from subjects at different stages of a disorder of CD8+ 
T cell priming or a disease in which CD8+ T cell priming is a component. The reference cells 

IS may also be cells from subjects responding or not responding to a particular drug treatment and 
optionally incubated in vitro or in vivo with the drug. 

The reference cells may also be cells from subjects responding or not responding to 
several different treatments, and the computer system indicates a preferred treatment for the 
subject. Accordingly, the invention provides a method for selecting a therapy for a patient 

20 having a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a 

component; the method comprising: (i) providing the level of expression of one or more genes 
that are differentially expressed during CD8+ T cell priming in a cell of a patient with a 
disease/disorder; (ii) providing a plurality of reference profiles, each associated with a therapy, 
wherein the subject expression profile and each reference profile has a plurality of values, each 

25 value representing the level of expression of a gene that is differentially regulated during CD8+ 
T cell priming; and (iii) selecting the reference profile most similar to the subject expression 
profile, to thereby select a therapy for said patient. In a preferred embodiment step (iii) is 
performed by a computer. The most similar reference profile may be selected by weighing a 
comparison value of the plurality using a weight value associated with the corresponding 

30 expression data. 
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The relative abundance of a mRNA in two biological samples may be scored as a 
perturbation and its magnitude determined (i.e., the abundance is different in the two sources of 
mRNA tested), or as not perturbed (i.e., the relative abundance is the same). In various 
embodiments, a difference between the two sources of RNA of at least a factor of about 25% 
5 (RNA from one source is 25% more abundant in one source than the other source), more usually 
about 50%, even more often by a factor of about 2 (twice as abundant), 3 (three times as 
abundant) or 5 (five times as abundant) is scored as a perturbation. Perturbations may be used by 
a computer for calculating and expression comparisons. 

Preferably, in addition to identifying a perturbation as positive or negative, it is 
10 advantageous to determine the magnitude of the perturbation. This may be carried out, as noted 
above, by calculating the ratio of the emission of the two fluorophores used for differential 
labeling, or by analogous methods that will be readily apparent to those of skill in the art. 

The computer readable medium may further comprise a pointer to a descriptor of a stage 
of or to a treatment for a disorder of CD8+ T cell priming or a disease in which CD8+ T cell 
1 5 priming is a component. 

In operation, the means for receiving gene expression data, the means for comparing the 
gene expression data, the means for presenting, the means for normalizing, and the means for 
clustering within the context of the systems of the present invention may involve a programmed 
computer with the respective functionalities described herein, implemented in hardware or 
20 hardware and software; a logic circuit or other component of a programmed computer that 
performs the operations specifically identified herein, dictated by a computer program; or a 
computer memory encoded with executable instructions representing a computer program that 
may cause a computer to function in the particular fashion described herein. 

Those skilled in the art will imderstand that the systems and methods of the present . 
25 invention may be applied to a variety of systems, including IBM-compatible personal computers 
running MS-DOS or Microsoft Windows. 

The computer may have internal components linked to external components. The 
intemal components may include a processor element intercomected with a main memory. The 
computer system may be an Intel Pentium®-based processor of 200 MHz or greater clock rate 
30 and with 32 MB or more of main memory. The extemal component may comprise a mass 

storage, which may be one or more hard disks (which are typically packaged together with the 
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processor and memory). Such hard disks are typically of 1 GB or greater storage capacity. 
Other external components include a user interface device, which may be a monitor, together 
with an inputting device, which may be a "mouse", or other graphic input devices, and/or a 
keyboard. A printing device may also be attached to the computer. 
5 Typically, the computer system is also linked to a network link, which may be part of an 

Ethemet link to other local computer systems, remote computer systems, or wide area 
communication networks, such as the Intemet. This network Hnk allows the computer system to 
share data and processing tasks with other computer systems. 

Loaded into memory during operation of this system are several software components, 

10 which are both standard in the art and special to the instant invention. These software 

components collectively cause the computer system to function according to the methods of this 
invention. These software components are typically stored on a mass storage. A software 
component represents the operating system, which is responsible for managing the computer 
system and its network interconnections. This operating system may be, for example, of the 

15 Microsoft Windows' family, such as Windows 95, Windows 98, or Windows NT. A software 
component represents conunon languages and fimctions conveniently present on this system to 
assist programs implementing the methods specific to this invention. Many high or low level 
computer languages may be used to program the analytic methods of this invention. Instructions 
may be interpreted during run-time or compiled. Preferred languages include C/C-H-, and 

20 JAVA®. Most preferably, the methods of this invention are programmed in mathematical 
software packages which allow symbolic entry of equations and high-level specification of 
processing, including algorithms to be used;, thereby freeing a user of the need to procedurally 
program individual equations or algorithms. Such packages include Matlab fix)m Mathworks 
(Natick, Mass.), Mathematica from Wolfram Research (Champaign, III), or S-Plus from Math 

25 Soft (Cambridge, Mass.). Accordingly, a software component represents the analytic methods of 
this invention as programmed in a procedural language or symbolic package. In a preferred 
embodiment, the computer system also contains a database comprising values representing levels 
of expression of one or more genes whose expression is characteristic of a disorder of CD8+ T 
cell priming or a disease in which CD8+ T cell priming is a component. The database may 

30 contain one or more expression profiles of genes whose expression is characteristic of a disorder 
of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component. 
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In an exemplary implementation, to practice the methods of the present invention, a user 
first loads expression profile data into the computer system. These data may be directly entered 
by the user from a monitor and keyboard, or from other computer systems linked by a network 
connection, or on removable storage media such as a CD-ROM or floppy disk or through the 
5 network. Next the user causes execution of expression profile analysis software which performs 
the steps of comparing and, e.g., clustering co-varying genes into groups of genes. 

In another exemplary implementation, expression profiles are compared using a method 
described in U.S. Patent No. 6,203,987. A user first loads expression profile data into the 
computer system. Geneset profile definitions are loaded into the memory fi"om the storage media 
10 or firom a remote computer, preferably fi*om a dynamic geneset database system, through the 
network. Next the user causes execution of projection software which performs the steps of 
converting expression profile to projected expression profiles. The projected expression profiles 
are then displayed. 

In yet another exemplary implemaitation, a user first leads a projected profile into the 
15 memory. The user then causes the loading of a reference profile into the memory. Next, the user 
causes the execution of comparison software which performs the steps of objectively comparing 
the profiles. 

6.4. Exemplary diagnostic and prognostic compositions and devices of the invention 
Any composition and device (e.g., a microarray) used in the above-described methods are 
20 within the scope of the invention. 

In one embodiment, the invention provides a composition comprising a plurality of 
detection agents for detecting expression of genes in FIGURE 1 . In a preferred embodiment, the 
composition comprises at least 2, preferably at least 3, 5, 10, 20, 50, or 100 different detection 
agents. A detection agent may be a nucleic acid probe, e.g., DNA or RNA, or it may be a 
25 polypeptide, e.g., as antibody that binds to the polypeptide encoded by a gene listed in FIGURE 
1 . The probes may be present in equal amount or in different amounts in the solution. 

A nucleic acid probe may be at least about 10 nucleotides long, preferably at least about 
15, 20, 25, 30, 50, 100 nucleotides or more, and may comprise the fiiU length gene. Preferred 
probes are those that hybridize specifically to genes listed in FIGURE 1. If the nucleic acid is 
30 short (i.e., 20 nucleotides or less), the sequence is preferably perfectly complementary to the 

target gene (i.e., a gene that is involved in CD8+ T cell priming), such that specific hybridization 
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may be obtained. However, nucleic acids, even short ones, that are not perfectly complementary 
to the target gene may also be included in a composition of the invention, e.g., for use as a 
negative control. Certain compositions may also comprise nucleic acids that are complementary 
to, and capable of detecting, an allele of a gene. 

5 In a preferred embodiment, the invention provides nucleic acids which hybridize imder 

high stringency conditions of 0.2 to 1 x SSC at 65 °C followed by a wash at 0.2 x SSC at 65 
to genes that are differentially regulated during CD8+ T cell priming. In another embodiment, 
the invention provides nucleic acids which hybridize imder low stringency conditions of 6 x SSC 
at room temperature followed by a wash at 2 x SSC at room temperature. Other nucleic acids 

10 probes hybridize to their target in 3 x SSC at 40 or 50 "^C, followed by a wash in 1 or 2 x SSC at 
20, 30, 40, 50, 60, or 65 ^C. 

Nucleic acids which are at least about 80%, preferably at least about 90%, even more 
preferably at least about 95% and most preferably at least about 98% identical to genes involved 
in CD8+ T cell priming or cDNAs thereof, and complements thereof, are also within the scope of 

15 the invention. 

Nucleic acid probes may be obtained by, e.g., polymerase chain reaction (PCFL) 
ampUfication of gene segments from genomic DNA, cDNA (e.g., by RT-PCR), or cloned 
sequences. PCR primers are chosen, based on the known sequence of the genes or cDNA, that 
result in amphfication of unique fragments. Computer programs may be used in the design of 

20 primers with the required specificity and optimal amplification properties. See, Oligo 

version 5.0 (National Biosciences). Factors which apply to the design and selection of primers 
for amplification are described, for example, by Rylchik, W. (1993) "Selection of Primers for 
Polymerase Chain Reaction," in Methods in Molecular Biology, Vol. 15, White B. ed., Humana 
Press, Totowa, N.J. Sequences may be obtained from GenBank or other public sources. 

25 Oligonucleotides of the invention may be synthesized by standard methods known in the 

^» by use of an automated DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides maybe 
synthesized by the method of Stein et al. (1988, NucL Acids Res. 16: 3209), methylphosphonate 
oligonucleotides may be prepared by use of controlled pore glass polymer supports (Sarin et al., 

30 1988, PNAS 85: 7448-7451), etc. In another embodiment, the oUgonucleotide is a 2'-0- 
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methylribonucleotide (Inoue et al, 1987, NucL Acids Res. 15: 6131-6148), or a chimeric RNA- 
DNA analog (Inoue et al, 1987, FEES Lett. 215: 327-330). 

Probes having sequences of genes Hsted in FIGURE 1 may also be generated 
synthetically. Single-step assembly of a gene from large numbers of oligodeoxyribonucleotides 

5 may be done as described by Stemmer et al.. Gene (Amsterdam) (1995) 164(l)'A9-53. In this 
method, assembly PGR (the synthesis of long DNA sequences from large numbers of 
oligodeoxyribonucleotides (oligos)) is described. The method is derived from DNA shuffling 
(Stemmer, Nature (1994) 370:389-391), and does not rely on DNA ligase, but instead relies on 
DNA polymerase to build increasingly longer DNA fragments during the assembly process. For 

10 example, a 1 .1-kb fragment containing the TEM-1 beta-lactamase-encoding gene (bla) may be 
assembled in a single reaction from a total of 56 oUgos, each 40 nucleotides (nt) in length. The 
synthetic gene may be PGR ampUfied and makes this approach a general method for the rapid 
and cost-effective synthesis of any gene. 

**Rapid amplification of cDNA aids," or RAGE, is a PGR method that may be used for 

1 5 amplifying cDNAs from a number of different RNAs. The cDNAs may be ligated to an 

oligonucleotide linker and amplified by PGR using two primers. One primer may be based on 
sequence from the instant nucleic acids, for which ftiU length sequence is desired, and a second 
primer may comprise a sequence that hybridizes to the oligonucleotide linker to ampUfy the 
cDNA. A description of this method is rq>orted in PGT Pub. No. WO 97/191 10. 

20 In another embodiment, the invention provides a composition comprising a plurality of 

agents which may detect a polypeptide encoded by a gene mvolved in GD8+ T cell priming. An 
agent may be, e.g., an antibody. Antibodies to polypeptides described herein may be obtained 
commercially, or they may be produced according to methods known in the art. 

The probes may be attached to a soUd support, such as paper, membranes, filters, chips, 

25 pins or glass slides, or any other appropriate substrate, such as those fiirther described herein. 
For example, probes of genes involved in GD8+ T cell priming may be attached covalently or 
non covalently to membranes for use, e.g., in dotblots, or to solids such as to create arrays, e.g., 
microarrays. 

6. 5. Alternative Diagnostic Methods 

30 In other embodiments of the diagnostic methods contemplated by the present invention, 

the method of diagnosis comprises the steps of evaluating the activity of a protein encoded by a 
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gene selected from the panels of the invention in the CD8+ T cells of a subject, and comparing 
the activity of said protein in said subject's cells with that in a cell of the same type taken from a 
nomial subject. In certain embodiments, a particular type of a disorder of CD8+ T cell priming 
or a disease in which CD8+ T cell priming is a component may be diagnosed if the protein 
5 whose activity is determined is associated with a particular type of a disorder of CD8+ T cell 
priming or a disease in which CD8+ T cell priming is a component. Assays to determine the 
activity of a particular protein are routinely used in the art, are well-known to one of skill in the 
art, and may be adapted to the methods of the present invention with no more than routine 
experimentation. 

10 

7 ThCTapeutic and Diagnostic Kits 

The present invention provides kits for treating disorders of CD8+ T cell priming or 
diseases in which CD8+ T cell priming is a component. For example, a kit may also comprise 
one or more nucleic acids corresponding to one or more genes characteristic of a disorder of 

15 CD8+ T cell priming or a disease in which CD8+ T cell priming is a component, e.g., for use in 
treating a patient having that disorder. The nucleic acids may be included in a plasmid or a 
vector, e.g., a viral vector. Other kits comprise a polypeptide encoded by a gene characteristic of 
a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component or 
an antibody to a polypeptide. Yet other kits comprise compounds identified herein as agonists or 

20 antagonists of genes characteristic of a disorder of CD8+ T cell priming or a disease in which 
CD8+ T cell priming is a component. The compositions may be pharmaceutical compositions 
comprising a pharmaceutically acceptable excipient. Kits comprising any of the pharmaceutical 
compositions of the present invention are also within the scope of the invention. 

A kit may comprise a microarray comprising probes of genes that are differentially 

25 expressed during CD8+ T cell priming. A kit may comprise one or more probes or primers for 
detecting the expression level of one or more genes that are differentially expressed during CDS+ 
T cell priming and/or a solid support on which probes attached and which may be used for 
detecting expression of one or more genes that are differentially expressed during CD8H- T cell 
priming. A kit may fiirther comprise nucleic acid controls, buffers, and instructions for use. 

30 The present invention further provides a kit comprising a library of gene expression 

patterns and reagents for evaluating one or more expression levels of genes. To give but one 
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example, the expression level may be detemiined by providing a kit containing an appropriate 
assay and an appropriate microarray with an array of probes. In another embodiment, the kit 
comprises appropriate reagents for evaluating the level of protein activity in the CD8+ T cells of 
a subject. The kits may be useful for identifying subjects that are predisposed to developing a 

5 disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a component or 
who have a disorder of CD8+ T cell priming or a disease in which CD8+ T cell priming is a 
component, as well as for identifying and validating therapeutics for a disorder of CD8+ T cell 
priming or a disease in which CD8+ T cell priming is a component. In one embodiment, the kit 
comprises a computer readable medium on which is stored one or more gene expression profiles 

10 of diseased cells of a subject having a disorder of CD8+ T cell priming or a disease in which 
CD8+ T cell priming is a component, or at least values representing levels of expression of one 
or more genes that are differentially expressed during CD8H- T cell priming. The computer 
readable medium may also comprise gene expression profiles of counterpart normal cells, 
diseased cells treated with a dmg, and any other gene expression profile described herein. The 

15 kit may comprise expression profile analysis software capable of being loaded into the memory 
of a computer system. 

A kit may comprise appropriate reagents for evaluating the level of protein activity in the 
CD8+ T cells of a subject. 

Kit components may be packaged for either manual or partially or wholly automated 
20 practice of the foregoing methods. In other embodiments involving kits, this invention 

contemplates a kit including compositions of the present invention, and optionally instmctions 
for their use. Such kits may have a variety of uses, including, for example, imaguig, diagnosis, 
therapy, and other applications. 



25 EXEMPLIFICATION 

The present invention is further illustrated by the following examples which should not 
be construed as Umiting in any way. The contents of all cited references including literature 
references, issued patents, published or non published patent applications as cited throughout this 
^plication are hereby expressly incorporated by reference. The practice of the present invention 

30 will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, 
molecular biology, transgenic biology, microbiology, recombinant DNA, and immunology, 
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which are within the skill of the art. Such techniques are explained folly in the literature. (See, 
for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and 
Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. 
Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 

5 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription 
And Translation (B. D. Hames & S. J. Higgins eds. 1984); (R. L Freshney, Alan R. Liss, Inc., 
1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To 
Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic Press, Inc., N.Y.); 
Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1987, Cold 

10 Spring Harbor Laboratory); , Vols. 154 and 155 (Wu et al. eds.). Immunochemical Methods In 
Cell And Molecular Biology (Mayer and Walker, eds.. Academic Press, London, 1987); 
Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 
1986) (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y,, 1986). 

15 Example 1 : Production of Naive and Primed CD8+ T cells 

Naive cells were freshly isolated from the spleens of 4-6 week-old 2C/RAG2"^' mice 
using a negative selection protocol. These mice expressed a transgenic T cell receptor specific 
for a-ketoglutarate dehydrogenase. Spleens from the mice were macerated and washed once 
with DMEM containing 10% PCS. 2C CD8+ T cells were enriched using StemSep™ 

20 Enrichment Cocktail for Murine CD8+ T cells (Stem Cell Technologies, Vancouver, BC, 
Canada) according to manufacturer's instructions. 

To generate primed cells, 10^ freshly purified 2C/RAG2"^' CD8+ T cells were plated with 
5 X 10^ P815.B7-1 cells pretreated with mitomycin C (50 mg/mL/10^ cells for 90 min. at 3TC; 
Sigma, St. Louis, MO) in 1.5 mL DME, in each well of a 24-well plate. P815.B7-1 is an 

25 alloantigen-bearing cell line, that was engineered to express the co-stimulatory molecule B7, 1 . 
Primed cells were collected after 5 days co-culture with the mitomycin C-treated P815-B7.1 
cells. Remaining cell debris in the culture was eliminated using FicoU-Hypaque centrifugation. 
The isolated primed 2C CD8+ T cells were assayed and found to have ftmctional cytotoxic T cell 
activity. In certain examples, cells primed for a second 4-5 day period were used as indicated. 

30 A schematic of the above-described process is shown in FIGURE 5. 
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Example 2: Characterization of Naive and Primed 2C CD8+ T cells 
2. a, f^HJ 'Thymidine Incorporation 

3x10"* purified 2C CD8+ T cells were simulated with 3x10"* mitomycin C-treated 
P815.B7-1 cells in a 96-well microliter plate. At various times after plating, 1 mCi [methyl'-^H] 
5 thymidine (Amersham Pharmacia Biotech, Inc. Piscataway, NJ) was added to the cells. After 6 
hours, the plates were frozen until ready to harvest. The wells were harvested for determination 
of [^H]-thymidine incorporation using a Packard cell harvester and plate reader (Packard, 
Meriden, CT). 

Stimulation of purified naive 2C CD8+ T cells generated as described in Example 1 with 

10 mitomycin C-treated P81 5-B7. 1 cells resulted in vigorous thymidine incorporation that peaked 
on day 3, with an expansion of viable T cell numbers that lagged behind by 24 hours. By day 5, 
the cells re-entered a resting state, at which time they did not proUferate or spontaneously 
produce cytokines. The primed cells used in Examples 3 and 4 were taken on the fifth day of 
stimulation. 

15 2.b. Cell Counts 

At each day of priming, cells grown in a 24-well plate were collected fi^om three wells 
and stained with trypan blue. The average number of trypan blue-negative cells was calculated. 
Cells were dually stained with PE-anti-CD8 (Pharmingen, La JoUa, CA) and FITC-anti-K*^ 
(Pharmingen, La JoUa, CA). to prevent any remainmg live P815.B7-1 cells firom interfering with 

20 an accurate count of the cells. After the exclusion of propidium iodide (PI) - positive cells, the 
percentage of CD8-positive and K*^-positive cells was determined. After day 1 of priming, no 
viable K^-positive cells remained in the culture. To obtain the number of live T cells, the 
percentage of CD8 positive cells was then multiplied by the number of trypan-blue negative 
' cells. Two days after priming, >98% of live cells stained positive for CD8. 

25 Zc. FACS Analysis 

To confirm the naive and primed states of the cells, they were stained for cell surface 
maricers CD44 (PE-anti-CD44, Pharmingen, La JoUa, CA) and CD62L (PE-anti-CD62L, 
Pharmingen, La JoUa, CA) at a 1:80 dilution. Potential non-specific binding was blocked with 
anti-Fc receptor monoclonal antibody 2.4G2. Single-color analysis was carried out using a 

30 FACScan™ flow cytometer and CellQuest™ software (BD Biosciences, San Jose, CA). 
Expression of CD44 and CD62L was verified prior to each experiment. 
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Phenotypically, naive 2C CD8+ T cells generated as in Example 1 were observed to be 
CD44*° and CD62L*'\ whereas the majority of effector cells were observed to be CD44^' and 

2.d. Cytokine Production ELISA 
5 For cell stimulation, 96-well flat-bottom plates (Costar) were coated with varying 

amounts of anti-CD3 (2C1 1) and 1 p.g/mL anti-CD28 (PV-1) overnight at 4*^0. Supematant was 
collected and assessed by ELISA. 

Primed effector 2C CD8+ T cells generated as in Example 1 were observed to routinely 
produce the cytokines IL-2 and IFN-y, 
10 2.e. Chromium Release Assay 

To assess cytolytic activity, 1.5 x 10^ P815.B7-1 (specific targets) or EL4 (syngeneic 
control targets) cells were labeled with ^^Cr-sodium chromate for 1.5 hours at 37T. After three 
washes, 2x10^ labeled targets were plated with 2x10^ naive or effector CD8+ T cells in a 96-well 
V-bottom plate (ICN Biomedicals, Costa Mesa, CA) and centrifuged for 5 min at 1500 rpm in a 
15 Sorvall RT6000B to allow the cells to come into contact. After 4 hour incubation under 
37**C/8% C02 conditions, 50 jiL supematant was transferred to a LumaPlate-96 (Packard, 
Meriden, CT) and allowed to dry. Plates were then counted using TopCoimt-NXT (Packard, 
Meriden, CT). 

Primed effector 2C CD8+ T cells generated as in Example 1 were observed to acquire 
20 antigen-specific cytolytic activity. 

Example 3 : Expression Profiling Protocol 

Differential gene expression in cells during priming of 2C CD8+ T cells was detected in 
homogeneous samples of the naive and primed 2C CD8+ T cells prepared above. 
25 3. L Preparation of RNA for Gene Chip Analysis 

Total RNA was purified from naive and primed CD8+ T cells. Cells were first subjected 
to FicoU-Hypaque centrifiigation to remove dead cells, then remaining cells in the interface were 
isolated, washed, and lysed in Trizol® (Invitrogen, Grand Island, NY) Total RNA was then 
treated with DNase I, and fiuther purified using the Qiagen RNeasy® Clean-Up protocol. To 
30 verify the integrity of the isolated RNA, ahquots of each sample were electrophoresed in 1% 
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denaturing agarose gels. Samples that exhibited an intact 28S and 18S ribosomal band were 
selected for generation of probes. 

3.2. Gene Chip Array Analysis 

The RNAs were prepared for Affymetrix microarray analysis using materials and 
5 methods provided by Affymetrix. (Mahadevappa, M. and Warrington, J. A., (1999) Nat, 

Biotechnol. 17:1 134-1 136). Briefly, cDNA was made with the SuperScriptTM Choice System 
(Life Technologies, Grand Island, NY) using 1 mL of 100 mM oligo T7-dT24 primer. The 
double-stranded cDNA was extracted with phenol:chlorofomi:isoamyl alcohol, precipitated with 
0.5 voltunes of 7.5 M ammoniiun acetate and 2.5 volumes of 100% ethanol. In vitro 

10 transcription with biotin-labeled ribonucleotides was performed with the ENZO Bio Array High 
Yield RNA Transcript Labeling Kit (Enzo Diagnostics, New York, NY). Labeled in vitro 
transcripts were purified over RNeasy® mini columns (Qiagen, Valencia, CA) according to 
manufacturer's instructions. 

An Affymetrix gene chip array containing approximately 13,000 genes was used in two 

15 independent screens. 12.5 ^g of purified, labeled cRNA transcript was used to hybridize with 
each Affymetrix array according to manufacturer's instructions. The cRNAs were fi-agmented 
and hybridized to the microarray overnight. The hybridized array was stained with SAFE 
(streptavidin-phycoerythrin). The hybridization levels (e.g. SAFE fluorescence) were measured 
using a Hewlett-Packard GeneArray scanner. Initial data analysis was perfomied using 

20 Spotfire® Array Explorer™ (Spotfire, Somerville, MA) software. 

The relative abundance of an mRNA in the two samples was scored and its magnitude 
detemiined (i.e., the abundance is different in the two sources of mRNA tested), or as not 
changed (i.e., the relative abundance is the same). As used herein, a difference between RNA 
derived from naive and primed cells is at least a factor of about 2 (twice as abundant) in two 

25 different samples. Present detection methods allow reliable detection of difference of an order of 
about 2-fold to about 5-fold, but more sensitive methods that will distinguish lesser magnitudes 
of perturbation are in development. Genes that were present in both sets and had values more 
than 2 were chosen for the list of genes in FIGURE 1 . 



-90- 



Example 4 Validation of Gene Chip Array Analysis 

4.a. RT'PCR 

Gene chip array results were confirmed by semi-quantitative RT-PCR. 20 |ig total RNA 
was used for first strand cDNA synthesis. Samples were normalized based on equivalent p-actin 
5 expression by RT-PCR. For each PGR amplification, 1 jxL cDNA was used in a 50 jxL reaction. 
To ensure that amplification remained within the linear range, 1:5 serial dilutions of cDNA were 
made. RT-PCR for P-actin (25 cycles) was used as a control for mRNA abundance. The 
number of cycles ranged from 25 to 45. Annealing temperatures varied, since the Tm of each 
primer differed, and amplification was carried out at 72''C. RT-PCR for each gene was 
10 performed several times using different batches of cDNA. 
4.b, SDS-PAGE 

Cells were lysed in eitiier Triton lysis buffer [50 mM Tris pH 7.6, 150 mM NaCl, 0.5% 
Tirton X-100, 5 mM EDTA, ImM trypsin inhibitor, ImM benzamidine, 1 mM sodium 
orthovanadate (Na3V04), 10 \xg/mL aprotinin, 25 jiM p-nitrophenyl-p'guanidinobenzoate 

15 (NPGB), 1 mM NaF, 1 mM PMSF], or RIPA buffer [Ix PBS, 1% Igepal CA-630, 0.5% sodium 
deoxycholate, 0.1% SDS, 5 mM EDTA, 1 mM trypsin inhibitor, 1 mM benzamindine, ImM 
sodium orthovanadate, 10 fig/mL aprotinin, 25 ^iM NPGB, 1 mM NaF, 1 mM PMSF]. Cytosolic 
lysates from 3-5 xlO^ naive and primed 2C/RAG2"'' CD8+ T cells were subjected to SDS-PAGE 
on 10-15% acrylamide gels. Lysates from equal numbers of cells were always loaded on the 

20 same blot. Total protein content in naive and primed cells was measured using DC protein 
Assay Reagent (Bio-Rad, Hercules, CA), 

FIGURE 6 depicts the differential expression of putative anti-proliferative genes in naive 
and primed 2C CD8+ T cells as detected by the RT-PCR method. Each reaction contained 
decreasing amounts of cDNA (1 :5 serial dilution), as indicated by the descending triangle. (A) 

25 BTG/TOB family. (B) TSC-22 and G0S8. Both TSC-22 and G0S8 were represented on the 
microarrays, as indicated (arbitrary units of expression). Values represent averages &om two 
independent gene array screens. 

FIGURES 7-9 depict the validation of gene array results by RT-PCR for selected 
cytoskeleton-related proteins (FIGURE 7), selected adapter proteins, signaling molecules, and 

30 transcription factors (FIGURE 8), and selected effector function-, metabolism-, and 

cytoskeleton-related genes (FIGURE 9). FIGURES 7-9 A depict the average numbers(in 
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arbitrary units of expression) from two independent gene array screens of naive and effector 2C 
CDS^T cells. NOC, not represented on chip. FIGURES 7-9 B depict RNA expression was 
verified by semiquantitative RT-PCR as described in Materials and Methods. Naive and effector 
cDNA were normalized based on jS-actin expression. Each reaction contained decreasing . 
5 amounts of cDNA (1:5 serial dilution), as indicated by the descending triangle. Equal numbers 
of naive and effector cells were lysed using Triton or RIPA lysis buffer. Lysates from 3x10^ 
cells were loaded for each sample. 

FIGURE 3 depicts certain of these genes whose differential expression was validated by 
RT-PCR, classified by their biological function and whether or not they were up or 
10 downregulated during 2C CD8-f T cell priming. These results are exemplary and are not intended 
to limit the scope of the invention. 

EQUIVALENTS 

The present invention provides among other things novel panels of molecular targets that 
15 are differentially expressed during CD8+ T cell priming. While specific embodiments of the 
subject invention have been discussed, the above specification is illustrative and not restrictive. 
Many variations of the invention will become apparent to those skilled in the art upon review of 
this specification. The fiill scope of the invention should be determined by reference to the 
claims, along with their full scope of equivalents, and the specification, along with such 
20 variations. 

All pubUcations and patents mentioned herein, including those items listed below, are 
hereby incorporated by reference in their entirety as if each individual publication or patent was 
specifically and individually indicated to be incorporated by reference. In case of conflict, the 
present application, including any definitions herein, will control. 
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